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CCAAACCCAG 


CTATCAACCC 


CATAAAGATA 


AAAAGAATGG 


TCTACTAAGC 


AAGCTTCCTT 


7440 


CCTCTGATTT 


TCAAACAAAA 


AT C T CT C ATT 


GCTTACATTG 


TTCTCTCAAG 


CTTATTGGTC 


7500 


ACT AT TAT C A 


ATATAGGTGG 


TTCTTACTAT 


CTCCAAGGAA 


TCTTGGATGA 


ATACATTCCA 


7560 


AAT C AG AT G A 


AATCAACTTT 


AGGAATCATC 


TCAGTTGGTC 


TGGTTATCAC 


CTATATCCTC 


7620 


CAACAAGTCA 


TGAGCTTCTC 


CAGAGATTAT 


CTCCTAACCG 


TTCTGAGTCA 


GAGATTAAGT 


7680 


ATTGATGTGA 


TTTTATCCTA 


TATTCGCCAT 


ATTTTTGAAC 


TTCCCATGTC 


TTTCTTTGCG 


7740 


ACACGTCGTA 


CAGGAGAAAT 


CAT TT C ACG A 


TTCACAGATG 


CTAACTCTAT 


TATAGATGCC 


7800 


TTGGCTTCTA 


CCATTCTTTC 


TCTTTTTCTG 


GATGTTTCTA 


TTCTGATTCT 


TGTAGGAGGC 


7860 


GTCTTACTGG 


CACAAAACCC 


TAATCTCTTC 


CTTCTTTCTC 


TTATTTCCAT 


TCCTATATAC 


7920 


ATGTTCATCA 


TCTTTTCTTT 


TATGAAACCT 


TTCGAAAAAA 


TGAACCATGA 


TGTCATGCAA 


7980 


AGTAATTCTA 


TGGTTAGCTC 


TGCCATTATC 


GAAGATATCA 


ACGGGATTGA 


AACTATAAAG 


8040 


TCGCTCACGA 


GTGAAGAAAA 


TCGCTATCAA 


AATATAGACA 


GCGAATTTGT 


AG ATT ATT T G 


8100 


GAAAAATCCT 


TTAAGCTCAG 


TAAATATTCT 


ATTTTACAAA 


C GAG T TT AAA 


GCAGGGAACA 


8160 


AAATTAGTTC 


TGAATATCCT 


TATCCTATGG 


TTTGGCGCTC 


AATTAGTCAT 


GTCAAGTAAA 


8220 


ATTTCTATCG 


GTCAGCTGAT 


TACCTTTAAC 


ACACTTTTTT 


CTTACTTTAC 


AACTCCTATG 


8280 


GAAAATATTA 


TCAACCTCCA 


AACCAAACTC 


CAATCTGCGA 


AGGTCGCTAA 


TAACCGTTTG 


8340 


AACGAAGTCT 


ATCTAGTCGA 


ATCTGAATTT 


CAAGTTCAAG 


AAAACCCTGT 


TCATTCACAT 


8400 


TTTTTGATGG 


GCGATATTGA 


ATTTGATGAC 


CTTTCTTATA 


AGTATGGTTT 


TGG AT GAG AT 


8460 


ACCTTAACAG 


AT ATT AAT CT 


C ACG AT T AAA 


CAAGGAGATA 


AGGTTAGCCT 


AGTTGGAGTT 


8520 


AGTGGTTCTG 


GTAAAACAAC 


TTTAGCCAAA 


ATGATTGTCA 


ATTTCTTTGA 


AC C CT AC AAA 


8580 


GGGCATATTT 


C CAT C AAT C A 


T C AGG AT AT T 


AAAAACATTG 


ATAAAAAAGT 


CTTGCGCCGT 


8640 


CATATTAATT 


ACCTACCCCA 


ACAAGCCTAT 


ATCTTTAATG 


GCTCTATTTT 


GGAAAACTT A 


8700 


ACCTTGGGCG 


GT AAT CAT AT 


GATTAGTCAA 


GAAG AT ATT C 


TAAAAGCTTG 


TGAAGTAGCT 


8760 


GAAATCCGTC 


AAGACATTGA 


AAGAATGCCT 


ATGGGCTATC 


AAACTCAGCT 


CTCTGATGGA 


8820 


GCTGGTCTAT 


CAGGAGGACA 


GAAGCAACGA 


ATCGCTCTCG 


CTCGTGCTCT 


TTTAACTAAA 


8880 


TCTCCTGTTT 


TAATACTAGA 


TGAAGCTACT 


AGCGGTCTTG 


ATGTCTTGAC 


TGAGAAAAAG 


8940 


GTTATAGATA 


ATCTTATGTC 


TCTAACTGAT 


AAAACC AT T C 


TCTTTGTAGC 


CCATCGTCTC 


9000 


AGTATAGCCG 


AACGAACCAA 


CCGTGTCATT 


GTTCTTGACC 


AGGGGAAAAT 


CATTGAAGTT 


9060 


GGTA 












9064 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTCCATTTTT TTGATTTCAT AAATAAACAA CCTCTCTGTT AATTTTGTAT AATTATAACG 6 0 

ATATCCAAGT TACTTGTCAA GTGTTTTTTA AATTTTTATC TCAAAAATAT TTTTTCGTTC 12 0 

AAAAAAAGGA GCCATCAGTT GATTTCAAGC TCCCTTTTAT ACAGAATTAA ACTATTTTAT 18 0 

AGTTCGACAA TCTTACCTGT TTCAAAGTAG ACAACCCATT CACAGATATT TTTAGCATAG 240 

TCACCGATAC GCTCCAAGTA GGAAATAACT TGGAAATAAT CACGACCCGT AACAATGGCT 300 

TCTGGATTTT TCTTAATCTC TTCAGTCGCA AGGTCACGGA TAGTTTCAAA ATAGTGGTTA 3 60 

ATTTGCTCAT CCATGGAGGC C AC C CGGT AT GCGTCGTCAA CAGAACCATT AAGATAAAGA 42 0 

TCAAGTGCTG CTTCCACAAC GCTTTTAACT TCACGTCCCA TTTTTTTAAT TTCTTCCTCT 4 80 

ACAGCTGGAA TGCGCTCTTC CCCCTTCATA CGGATGGTTG CCTGGGCAAT GGCT AC AG C G 54 0 

TGATCCCCCA TACGCTCCAC ATCTGATACA GCCTTAAGGA CAGTCAAGAC TGTACGCAAA 600 

TCTTGAGAGA CTGGTTGTTG G AGTGC G AT C ATTTCAAATG ATTTCTTTTC CAGTTTCACT 6 60 

TCGTATTCAT TTACTTCTGC ATCATCTTCG ATGACCTCTT TTGCCAGGTC ACGGTCATGC 72 0 

GTGACAAAAG CACGTACCGT ACGATTGATT TGT G AG AG C A CTTCTTGTCC CATAGCGTAG 7 80 

AACTGGTTAT GTAATTTCTC TAAATCTTCT TCAAATTGAG AT CGT AAC AT CTTTCATCTC 840 

CTTATCCAAA TTTTCCTGTA ATATAGTCTT CCGTTTCCTT GTGTTGGGGA TCAAGGAACA 900 

TCTGCTTGGT ATC AT TAAAT TCAATCAAAT CTCCATCTAG GAAAAATCCT GTCTTATCAG 960 

AG AT ACGT G A AGCTTGCTGC ATGGAACGGG TTACCAGAAG CATGGTGTAC TTGTCTTTTA 102 0 

GACCATACAA GGTTTCCTCA ATTTTACCAG CTGAAATCGG ATCCAAAGCC GAAGTTGGCT 1080 

CATCCAAGAG GATGATTTTA GGACTAGTTG CCAAGACACG GGCCACGCAG ACACGCTGCT 1140 

GTTGACCACC TGACAATCCA AT AG CTGAAT CATATAGACG ATCCTTGACC TCATCCCAGA 12 00 

TAGAGGCACC TTGCAAGGCT TTTTCTACGG CTTCATCCAG AACCTGCTTA TCCTTAATTC 12 60 

CATTGATACG AAGCCCGTAG ACAACATTCT CATAGATAGT CATAGGGAAA GGATTAGGTT 132 0 

GTTGGAAAAC CATTCCGATT TCCTTACGTA ATTCAACCGT ATCTGTACGC GG AC TGT AG A 13 80 

TGTTGTGACC ATTGTACACC ACGGATCCAG TTGTGGTCAC CTCTGGATTG AGATCTCCCA 1440 
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TGCGGTTGAG AGACTTGAGG AGGGTTGACT TCCCTGATCC AGATGGACCA ATCAAGGCTG 1500 

TAATTTCCTT AGGTTGGAAA GATAGGGAAA CACTATTCAA AGCCTTCTTT TTATTATAAT 15 6 0 

AAACGGACAG GTCTGATACC TGTAAAATCG CATCTGTCAT ACGGTTTCCT TTCTAACCAA 1620 

AGTG AC C AG A T AC AT AG T CA TTGGTGGACT GTAGCTTGGC ATTTTGGAAA ATAGTTGCAG 1680 

TCTTGTCATA CTCAATCAAA TCACCCAAGT AAAAGAAGCC TGTATAGTCA CTTGCACGAG 1740 

CAGCCTGCTG CATATTATGC GTTACAATGA TGATGGTAAA GTTTTTCTTG AGCTCAAACA 1800 

TGGTCTCTTC TAGTTGCATG GTCGCAATCG GATCCAAGGC TGAGGCTGGC TCATCCATTA 1860 

AGAGGATATC TGGCTTAACA G AG AT GGC AC GAGCGATACA GAGACGTTGT TGCTGACCAC 1920 

CTGATAAGGT CAAGGCTGAC T TGTGGAG AT CGTCTTTAAC CTGATCCCAG AGGGCAGCCT 1980 

GACGAAGGGA GGTTTCTACG ATTTCATCTA GGACTTGCTT ATCCTTAACT CCAGCACGTT 2 04 0 

C ATGC G C AAA GGTAATATTA CGGTAAATTG ACTTAGCAAA TGGATTGGGA CGTTGAAAAA 2100 

CCATTCCAAT GTGTTTACGC ATTTCATAAA CGTTGATTTC TGGACGGTTG ACATCAATTC 2160 

CACGATAGAG AATCTGCCCA GTTACTTTAG CAATATCAAT AGTATCATTC ATGCGATTGA 222 0 

GACTGCGTAA GTAGGTAGAT TTCCCCGATC CCGACGGGCC AATCAAAGCT GTAATTTTAT 22 80 

TTCTTTCAAA TTGCATATCA ATCCCCTTAA TGGATTCATT TTTACCATAG TAAACATGGA 2 3 40 

CATCCTTAGT AGAAAGGGCT ACTTTTTCTT CAGGAAAGGT AAGGATATGC TTCTCATCCC 2400 

AGTTATATGT TGACATGGCT TCTCCTTTAG GCAGCGGTTA ATTTCTTGTG TAGATAGCTT 24 60 

CCGAACTTAC GAGCTCCAAA GTTAAAAATC AGGATAAAGA TCAGGAGCAC AGCGGCAGAA 2 52 0 

CCTGCTGATA CAATGGTTCC AT CTGG AAT A GTGCCTTCAC TATTGACTTT CCAGATATGG 2 580 

ACAGCCAAGG TTTCTGCTTG ACGGAAGATA GAGATGGGGC TAGTCACACT GAGGATATTC 2 64 0 

C AGT TAG AC C AGTCAAGAGC TGGCGCCGAT TGCCCTGCTG TAT AG AT C AG AGCTGCAGCT 2700 

TCGCCAAAGA TACGACCAGA TGCCAAGACG ACACCCGTTA CAATACCTGG AAGCGCTTCC 2 7 60 

GGAATAACAA CATGAACCAC TGTCTCCCAG CGAGAAATCC CAAGAGCCAG ACCAGCCTCA 2 820 

CGTTGGGTAT GGTGAACGTG TTTCAAACTA TCCTCTACAT TACGCGTCAT CTGAGGCAAG 2 8 80 

TTAAAGACTG TCAAGGCCAA GGCACCTGAA ATGATTGAAA AT C CAT AC T C AAACTGGACT 2940 

ACAAAGATCA AGTAACCAAA GAGACCCACC ACCACTGATG GTAAAGAGGA CAAAATTTCA 3000 

ATACAAGTCC GCACAAAGTT GGTAACAGGA CCTTTTTTAG CAT AT TC AG C CAAGTAAATC 3060 

CCAGCTCCCA TAGAAAGAGG TACAGAAATA ATCAAGGTAA TGAC CAATAG GAAAAAGGAA 3120 

TTGTAAAGCT GAATGCCAAT CCCACCACCT GCTTGAAAAG CAGAAGACCT TCCAGTCAAG 3180 
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AAAGACCAAG AGATATGGGG CAAGCCCCGA ACCAAGATAT AGAGAATCAA GGAAGCCAAG 3240 

ATTGTCACAA TGATGCTAGC AAT CGT AT AG AGGACAGCTG TTGCAAGTTT ATCTAATTTC 3 3 00 

TTAGCGCGCA TAATTTTTCT TTCCTCTTTC TTTCGTAATC AATTTAATCA CACTGTTAAA 33 60 

AACTAAGCTC ATCAAGAGCA GTACCAAGGC CAGTGACCAG AGAACATTAT TATTTACAGT 342 0 

TCCCATGACA GTGTTCCCAA TTCCCATAGT TAATATAGAA GTTAAAGTTG CAGCTGGTGT 3480 

GGTCAAGGAA GTTGGGATAA CAGCTGAGTT TCCGACAACC AT C T GG AT AG CTAGAGCCTC 3 54 0 

AC C AAAGGC A CGCGCCATCC CAAAGACCAC TGCAGTGAAA ATACCAGAAC GGGCCGCCTT 3 600 

CAAGATCACA CGCCAGATAG TCTGCCAGCG AGTGGCTCCC ATAGCGAAAC TGGCTTCACG 3 660 

ATAATAACGA GGAACCGCAC GCAAGCTATC CGTTGTCATA AAGGTTACGG TCGGCAAAAT 372 0 

CATGACAAAG AGGACGGAAA TCCCTGACAA AATCCCAAAA CCAGTCCCAC CAAAGACACT 3780 

GCGAACAAAG GGAACGACGA CTTGCAAGCC AATAAATCCG T AC AC T ACTG AAGGAATCCC 3 840 

AACCAGGAGT TCAATAGCTG GTTGCAAAAT CTTCGCCCCT TTTGGTGATA CTTCGGTCAT 3 9 00 

AAAAACTGCT G C AC C AAT AG CAAAGGGTGT TGCGATAAGG GCTGAGAGAA TGGTAACGAT 3 9 60 

AAAGGAACCC AAAAT CAT AG GAAGGGCACC AAATTCTTTA CTAGAAGGAT TCCAAGTTCC 402 0 

TCCCAAAAGA AAGTCAAAGA TATTCACACC ATTGACAAAG AAGGTCGACA AGCCTTTTTG 4 080 

CGCTACGAAA AC C AAAAT C A TGGCCACAAG GATGACTATC AAAGAAAGAC AGGCAAAGGT 414 0 

CAAACCTTTT CCTAATTTCT CCAGACGAGA ATTCTTTGAT GGAAGCAACA TTTTCTTAGC 42 00 

TAATTCTTCT TGATTCATTA TTGTCTCCCT TCCAACACTG TCACAGTTCC GGCAGCATCT 42 6 0 

TTTTCAACCT TCATTTCCTT AAT CGGAAT A TACTTCAATC CTTTGACAAT CCCTTCTTGG 43 20 

GTCTCATCCG AGAGAACAAA ATTGAGAAAT TCTGCAGCCA ACTCATTGGG CTGCCCCAAT 4 380 

GTATACATAT GCTCATAAGA CCACAAGGGC CAATTATTGC TACTTATATT TTCTGGACTT 4440 

AAGTCATAGC CATTCAACTT CATGCTTTTG ACCGAATCAT CTATATAGGT AAGAGATAAA 4 5 00 

TAAGAGATAG CTCCTGGACT TTTTGATACG ATTGATTTTA CCGCTCCATT TGAATCCTGC 4 56 0 

TCCTGACTTT GCATGGCAGA CTGACCTTCC ATAATGACAG TATCAAAGGT AGCACGAGAG 4 62 0 

CCAGAGCCGG CTGCCCGATT GATAACAGAG ATGGGTAAGT CCTTACCACC AACCTCTTTC 4 680 

CAATTGGTTA CCTCACCTAT GAAGATTTGA CGAAGTTGCT CTGTCGTTAG GTTATCAACA 474 0 

TCAACCTCCT TATTGACAAT CAGAGCCAAG CCAGCTACCG CGACCTTGTG GTCAACAAGA 4 800 

GC AG AAG CAT CAATTCCGTC TTTTTCCTCA GCAAATACAT CTGAGTTTCC TATATCAACT 4860 

GCCCCAGACT GAACCTGGGA CAAGCCTGTA CCAGAACCTC CCCCTTGGAC ATTGACCGTT 4920 

TTTCCAACAT GGATCGTGCC AAATTC AT C T GCCGCTACTT CAACCAAGGG. TTGCAAGGCA 4980 
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GTTGAGCCAA CAGCCGTTAT GGATTCTCCA CGATCAATCC AGCTAGCACA GCCTACTAAA 5 040 

CAAGCCGTCA GCCAAAAAGC GATAAGAGAC AGAGCAAGCT TTTTTCTTTT TTTCACTGTT 5100 

TTTCTCCTCG AAAATAATTA TGAATACTGT GAATTTTTTA AGTAGTTCTT TATGAGTTGA 5160 

CGCATGAATT CTTACCAAAT TTCTGCGCAA TTGATTATTT ATATAATATA GGCTATATTA 52 2 0 

CTCTTTCCTA ACCTCCTTTT TTCATATGTG GATAAAATCT CTTGTCTATC CCTTCCCCCA 5280 

TTGTCACCCA TTATAGTCAT TTCGTGTCTC TTTTTCCCCT TTTTAATGCA AGGGAAATTA 534 0 

CTCTCCTTAG ATGATAATCC AAAAG CT AG A AAGGTATCTC AAACCTCTCT ACTCTCCCAG 5400 

ACTAGTTTAC AACTAAAAGG AAAAG AT T CT ATTTTATGAG AAATCTAGTT TACAAGCGGT 5460 

AAGAACGCTA ATAACTAAAC TTCTTGTACT CTTTGAAAAT CTCTTCAAAC CAGTGTTTTG 5520 

AG C TAT C TAT GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAG TAAAACTACA 5580 

TGTAATGGCA ATCAAGATAT CAAGAATCAT CCTACTAAAA AAAT C CAT AC TTTCACTATA 5 64 0 

ACATAGAATA AG AT ATT T G A CTAGCATTTT CATTTGAATC TGAGGCCTTT TGGAAAATAA 57 00 

TTTTTCAAAA CATTTCCAGT AACCTTTGCA AAGCCCAAGC CATTGCCTTT AACCAAAACT 57 60 

TGGTACCAAC CATTTGGCAG ACTTTCTGCC AGCTGAACGG TTTCTCCAGC CGCATACTTG 5820 

ACAAACGCTT CTTGGCCAAT TTCAACCGAC TGTTCGACCT GACTCGGTTT CAAGGCTAAA 5880 

CCAAGAGCGA AACTGGGCTC AAAGCGTTTC TTCTTAAAAG TACCCAGATG CAGTCCATTG 5940 

CGAGCAATCT TGAGCTTCCA TAAATCTGGC AAAAGTTCTG G C AAG AG AT A AAGCTGGTCT 6000 

CCAAAAATCT G C AAG AT AC C CGGTAGATTG ACCTTCAAAT GGTTTTGGGC AAATTCCTGC 6060 

CACAAGGCAA CTTGTTCACG GCTGAGGTTA CTCTTACTTG CCTTAAATTT AGGAGCTGGA 6120 

TTGTTACCCT TAAACTGTAG ATGGGCAACA AACTGACCCT CTCCCTTAAA CTGATGAGGA 6180 

TACATCCGAG CCGTTTCTGG CAGGTCAATA CCAGCTACCA TTCCATTGAT ATGCTCTACT 62 40 

GGCAACAAGT CAAAATCATA CTCTTCCAGC AACCAATTGA CAATCTCTTC GTTTTCCTCG 6300 

GGTGCCCAGG TACAGGTCGA ATAAACCAGA TGACCACCTT CAGCTAACAT GGTCACTGCA 63 60 

TCCTCCAGAA TTTCTCTTTG CAAGCTAGCA CATTGACTCG GATAATCTAA GCTCCAATAG 6420 

TCCATAGCAT CAGGTTGCTT ACGAAACATT CCTTCACCAG AGCAAGGGGC AT C AAG AACG 6480 

ATTAAGTCAA AATAGCCTTT AAAGACCTTG ACCAAGCGGT CGGCAGATTC ATTGGTCACC 6540 

ACGACATTTG TCGCTCCAAA ACGCTCCATG TTTTCAACCA AAATCTTAGC CCGTTTGCTT 6600 

GAAATTTCAT TGGAAnCAAG TAGCCCCTCC CCTGCTAGAT AGGCTGCCAG TTGAGTTGAT 6660 

TTGCCCCCCG GTGCAGCAGC CAAGTCCAAG ACCTTCATAC CAGGACTGGG TTGGGCTACT 672 0 
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TGAGCCACCA TTTGAGCAGC AGGTTCTTGC GAATAAACTA AACCTGTAGC ATGCTCAGGC 67 8 0 

GATTTCCCTG AAACCTTCCC ATAGTGGCCC CAAGGGGTTT GAGTAATGGC AT C AG AAAAG 684 0 

GAAAGTTGCT CTTCTTTTAA GGGATTGACC CGAAAGGCCG AAACCGCTTC CTCCTCAAAA 6 900 

GAGGCAAGAA AATCTCTTGC CTCATCTCCT AGTATCTCTT TAT ATT T T T C AACAAATCCT 6960 

TCTGGAAATT GCATTTAAGT TCTTTTCCTT TCGTAAATAT AGGACTGAAT TTCCTCCTGC 702 0 

ATCTCAAGAG GC AC CATC AT GACCGGCTGT CTGGTTTGAA AATCAGGAGC TTCACCAAAA 7 0 80 

AGGGTCACAA CCCGATAGCC CAGACTTTCC CCTAAAATAC TAGCTGCGGC ATAATCCCAT 714 0 

GGTTGCAGAT AAGTGAGATA GGTCAACAAA CGCCCTGACA AAATCTTGGC AAAACTAATG 72 00 

GCCGCACTTC CATAGACACG AACACCAAGA ACCGCTCGGC TCAAATCAGC CAGCCCCCAT 72 60 

TCATTGGTTT CCAGCATACC ACTATTCCCT GCAATGAGAA AATCTCCAAG TGGTTTAGTT 73 2 0 

TTAAAAGGAG CTAGGGACCT AT CAT T TAG A CAAACTGGAA ATTCCCCACC ACCGTGGTAA 7 3 80 

CAATCCCCTT TGACCACATC ATAAATCAGA CCAAACTGTC CCTGACCATT TTCAAAATAA 744 0 

GCCATCATAA CAGCAAAATC TTCCTGCTGG GCTACAAAAT TATTGGTACC ATCAATGGGA 7 500 

TCAATGACCC AAACCTTGCC CTCTTGAACC GAGGCTCGCA GACAACCTTC T T C AG C AC AA 7 56 0 

ATCTTATCCT CAGGATAACG GGACAAAATC TCACCAACCA AGAGTTCCTG AACTTCTTTG 7 62 0 

TCCAGTCTGG TCACCAAATC TGTTGGAGAG GACTTGGTTT CAACACGCAA GTCTTCCTGC 7 68 0 

ATATGGTCAA GAATGTACTG ACCTGCTTTC TTAACAAGCT CTTTAGCAAA TTCAAATTTA 7 740 

CTTTCCAAGA GAAATCTTTC CTTCCCCTTT TTCTTTGGGG 7 780 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAATGATAT AGGAACACCA GGTGACCTGA TGGGACGTCG TAAGCCTATG AACTACTAGC 60 

TGCTAAAGGC TTTAAAGATG GTATGGTACC AT AT ATCT C A AACCAATACG AAGAAGAAGC 12 0 

CAAACAAAAG GGCAAGACAA TCAATCTCTA CGGTAAAACA AGAGGTTTGG TTACAGATGA 180 

CTTGGTTTTG GAAAAGGTAT TTAATAACCA ATATCATACT TGGAGTGAGT TTAAGAAAGC 2 40 

TATGTATCAA GAACGACAAG ATCAGTTTGA TAGATTGAAC AAAGTTACTT TTAATGATAC 3 00 

AACACAGCCT TGGCAAACAT TTGCCAAGAA AACTACAAGC AGTGTAGATG AATTACAGAA 3 60 
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ATTAATGGAC 


GTTGCTGTTC 


GTAAGGATGC 


AGAACACAAT 


TACTACCATT 


GGAATAACTA 


420 


CAATCCAGAC 


ATAGATAGTG 


AAGTCCACAA 


GCTCAAGAGA 


GCAATCTTTA 


AAGCCTATCT 


480 


TGACCAAACA 


AATG AT TTT A 


GAAGTTCAAT 


TTTTGAGAAT 


AAAAAATAGT 


GTC TACT ATT 


540 


AGGAAATAAA 


GTTTAAAAAG 


GTGATGAAGA 


ACAAACCAAG 


ATTCAAGCAG 


GAATTCCTAC 


600 


TGATAATGAA 


GTAAGTTATG 


ATCTTATTTA 


TCAGCAGGAA 


ACTCTTCCTG 


CAACAGGTTC 


660 


ATCAACTTCT 


GAGCTTACAG 


CTTTAGGCCT 


ATTAGCTGTT 


GGTAGTTTAG 


TTCTTTTGGT 


720 


TCATAATATG 


ACGGGAACAG 


TTTTTTGCTC 


CCTCTGAAAA 


GTCATCATTT 


GATGGCTTTT 


780 


TTCTATATAG 


GGTAAAAGAT 


AGGGTAAAAG 


GCTATCATCG 


GACAAAATAA 


AGAAGGCATG 


840 


ATATAATATA 


AAGT AG AT T T 


CTATGTCATA 


AAACAAGAAC 


TGTTTGGACA 


TCATTCATTT 


900 


GAAAACTCTC 


TATGTTCAAA 


CAATAGTAAA 


ATAAAATAGG 


GGATCTAAAT 


CCTTGCTATG 


960 


AAAGGAAAAA 


ACTCAATGGC 


TACTATTCAA 


TGGTTTCCTG 


GTCACATGTC 


TAAAGCTCGT 


1020 


CGACAGGTGC 


AGGAGAATTT 


AAAATTTGTT 


GATTTTGTGA 


CGATTTTAGT 


AGATGCACGC 


1080 


TTGCCTCTAT 


CTAGTCAAAA 


TCCTATGTTG 


ACCAAGATTG 


TTGGTGATAA 


ACCAAAACTC 


1140 


TTGATTTTAA 


ACAAGGCCGA 


CTTGGCTGAT 


CCAGCAATGA 


CCAAGGAATG 


GCGTCAGTAT 


1200 


TTTGAATCAC 


AAGGAATCCA 


GACGCTAGCT 


ATCAACTCCA 


AAGAGCAAGT 


GACTGTAAAA 


1260 


GTTGTAACAG 


ATGCGGCCAA 


GAAGCTCATG 


GCTGATAAGA 


TTGCTCGCCA 


GAAAGAACGT 


1320 


GGG ATT C AG A 


TTGAAACCTT 


GCGTACTATG 


ATTATCGGGA 


TTCCAAACGC 


TGGTAAATCA 


1380 


ACTCTGATGA 


ACCGTTTGGC 


TGGTAAAAAG 


ATTGCTGTTG 


TTGGAAACAA 


GCCAGGGGTC 


1440 


ACAAAAGGTC 


AACAATGGCT 


TAAAACCAAT 


AAAGACCTGG 


AAATCTTGGA 


TACACCGGGG 


1500 


ATTCTCTGGC 


CTAAGTTTGA 


GGATGAAACT 


GTTGCACTTA 


AGTTGGCATT 


GACTGGAGCT 


1560 


ATCAAAGACC 


AGTTGCTTCC 


TATGGATGAG 


GTTACCATTT 


TTGGTATCAA 


TTATTTCAAA 


1620 


GAACATTATC 


CAGAAAAGCT 


GGCTGAACGC 


TT C AAAC AAA 


TGAAAATTGA 


AGAAGAAGCG 


1680 


CCTGTGATTA 


TTATGGATAT 


GACCCGCGCC 


CTCGGTTTCC 


GTGATGACTA 


TGACCGTTTT 


1740 


TACAGTCTCT 


TCGTGAAGGA 


AGTCCGTGAT 


GGCAAACTCG 


GTAACTATAC 


CTTAGATACA 


1800 


TTGGAAGACC 


TCGATGGCAA 


CGATTAAAGA 


AATCAAAGAA 


TTCCTTGTGA 


CAGTCAAGGA 


1860 


GTTAGAAAGC 


CCTATTTTTT 


TAGAGCTTGA 


AAAGGATAAT 


CGCTCAGGAG 


TTCAAAAGGA 


1920 


AATCAGCAAG 


CGTAAAAGAG 


CCATTCAAGC 


TGAATTAGAT 


GAAAATTTGC 


GCTTGGAATC 


1980 


CATGCTTTCT 


TATGAAAAAG 


AACTTTATAA 


GCAAGGATTG 


ACCTTAATTG 


CAGGTATTGA 


2040 


TGAGGTTGGT 


CGTGGTCCTC 


TTGCTGGTCC 


TGTAGTCGCT 


GCGGCCGTTA 


TTTTATCTAA 


2100 
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AAATTGTAAG 


ATTAAAGGTC 


T C AACG AC AG 


CAAGAAAATT 


CCTAAAAAGA 


AAC AT CTGG A 


2160 


GATTTTCCAA 


GCCGTTCAAG 


ACCAAGCCTT 


GTCGATTGGA 


ATTGGTATCA 


T AG AT AAT C A 


2220 


GGTCATCGAC 


CAAGTCAACA 


TCTATGAAGC 


AACCAAACTA 


GCCATGCAAG 


AAGCAATCTC 


2280 


CCAGCTCAGC 


CCTCAACCAG 


AGCACCTTTT 


GATTGATGCC 


ATGAAACTGG 


ACTTGCCCAT 


2340 


TTCACAAACC 


TCCATTATCA 


AAGGAGATGC 


CAACTCCCTC 


TCTATCGCAG 


CAGCATCTAT 


2400 


AGTAGCCAAG 


GTAACACGTG 


ATGAATTGCT 


GAAAGAATAC 


GATCAGCAGT 


TCCCTGGCTA 


2460 


TGATTTCGCT 


ACTAATGCAG 


GATATGGCAC 


AGCTAAACAT 


CTGGAAGGCC 


T C AC AAAACT 


2520 


AGGAGTTACC 


CCAATTCACC 


GAACCAGCTT 


TGAACCCGTT 


AAATCACTGG 


TTTTAGGTAA 


2580 


AAAAGAAAGT 


TAATTGAAAG 


GAAATAACAT 


GGAGGAACAG 


TCGGAAATAG 


TCCGTTCTAA 


2640 


GAAAGAATTC 


GCCTTTGCAT 


CCAGCACTAT 


ACTATCCCAA 


GTTGGTCGAG 


GAATCATTGT 


2700 


CGGCCTCATC 


GTTGGAATTA 


TCGTCGGATC 


CTTTCGTTTC 


TTAATTGAAA 


AGGGCTTCCA 


2760 


CCTGATACAA 


GGAGTTTATC 


AAGATCAAGG 


GTACTTAGTG 


CGCAATCTTT 


TTGTACTGGT 


2820 


TTTGTTTTAT 


ATACTCATCT 


GTTGGCTCAG 


TGCCAAACTA 


AC ACGGT C AG 


AAAAAGATAT 


2880 


TAAAGGCTCA 


GGAATTCCTC 


AAGTCGAAGC 


CGAACTGAAA 


GGCCTCATGT 


CCCTCAACTG 


2940 


GTGGGGCATT 


CTTTGGAAAA 


AATATGTGCT 


AGGTATTCTT 


GCTATTGCCA 


GTGGACTCAT 


3000 


GCTGGGTCGA 


GAGGGACCCA 


GCATTCAACT 


TGGAGCAGTT 


GGTGGTAAAG 


GAATTGCCAA 


3060 


GTGGCTCAAA 


TCCAGTCCAG 


TAGAGGAACG 


TTCCTTGATT 


GCCAGTGGAG 


CTGCAGCAGG 


3120 


TTTAGCCGCA 


GCCTTTAATG 


CTCCTATTGC 


AGCACTTCTC 


TTTGTTGTAG 


AAGAAGTCTA 


3180 


TCACCATTTT 


TCGCGCTTTT 


TCTGGGTCTC 


AACTCTAGCA 


GCCAGCATCG 


TAG C AAACTT 


3240 


TGTGTCTCTA 


CTCATGTTCG 


GTTTGACACC 


AGTATTGGAT 


ATGCCAGATA 


ACATTCCTCC 


3300 


CATGACCCTA 


GATCAGTATT 


GG AT AT AT CT 


CGTCATGGGA 


ATTTTCCTTG 


GATTTTCAGG 


3360 


TTTTCTCTAT 


GAGAAAGCTG 


TATTAAACGT 


TGGAAGAGTT 


TATGACTTGA 


TTGGTCAAAA 


3420 


AATCCATTTG 


GATAGGGCTT 


ATTATCCCAT 


CTTGGCTTTT 






-> *± o U 


AATCTTCTTA 


CCTCAAATCA 


TTGGTGGCGG 


AAATCAGCTT 


GTCCTTTCTT 


TAACTGAACA 


3540 


AAATTTTAGT 


TTCCAAGTTT 


TATTAGCTTA 


CTTTTTAATC 


CGCTTTATTT 


GGAGTATGAT 


3600 


TAGCTATGGA 


AGTGGACTGC 


CAGGAGGAAT 


TTTCCTCCCC 


ATTTTAGCTC 


TTGGTTCTTT 


3660 


GCTTGGTGCC 


TTAGTTGGTG 


TT AT CTGTGT 


CAATCTTGGA 


CTTGTCAGTC 


AAGAGCAATT 


3720 


CCCTATATTT 


GTCATTCTAG 


GAATGAGTGG 


CTATTTTGGA 


GCCATATCAA 


AAGCTCCCTT 


3780 


AACCGCTATG 


ATCCTCGTAA 


CTGAGATGGT 


AGGAGATATT 


CGCAACCTTA 


TGCCACTTGG 


3840 


TCTTGTCACT 


CTTGTTTCTT 


ATATTATCAT 


GGATTTGCTC 


AAAGGTACGC 


CAGTCTATGA 


3900 
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AGCCATGCTG GAAAAAATGC TTCCAGAAGA AGTATCTAGC GAAGGAGAAG TTACACTTAT 3 960 

CGAAATACCA GTTTCTGATA AAATTGCTGG GAAACAAGTT CATGAACTCA ACTTACCACA 402 0 

CAACGTCCTC ATCACAACTC AAGTCCATAA TGGCAAGAGC CAAACAGTTA ACGGCTCAAC 4 080 

CAGAATGTAT CTGGGTGATA TG ATT C AC C T GGTTATTCCA AAAAGTGAAA TTGGAAAAGT 414 0 

CAAAGATTTG TTGTTGTAGT ATGAGTATTT ACATAATTTA TGTTATGTAA ATGATCAGTT 4200 

TGATTTATTT AGAAAACCGA TTCTCAGGAA TGAGATCGGT TATTTTTTAC TGATGAGGAA 4260 

TTTTACATAT AAATAATTGA ACTTTATTAA AAATAAGACT ATAATTAAGT TAGAAATGAT 43 2 0 

AAAGTATAAA GCTAGAAAGG AGTTTACTGT ATCAAATCTG TACAGTAAGA TT AAAAT CAT 43 80 

GAAAAAGAAA ACAATAGCAA T T ATAT AG AG AAATGAAATA GAAATAGGAT AAAACAATCA 4440 

GGACAATCAA ATCAATTTCT AGCAATGTTT TAGAAGTCCA GATGTACTAT TCTAGTTTCA 450 0 

AT C TAT TAT A CAATGTGTTT TGTATCTCAT AGCTCCTTAT ATAGCTCTTC AGTTATGTAG 4 560 

TATTAACAGA AGTTTAGTGG GTGAGATTTT TATTATTTTC CTTATTCTGT TTTGTTTGTA 4 62 0 

GGTCTAAGTC T T T TT AT C AC TTTGAAAAAC TCCTATAACA TCTTTCCGAA AAACTATAAT 4 680 

TTTCTTGAAA AATATACAAG TCTATGCTAT ACTACTAGTA TACTTACTTA T GG AG AAAAT 474 0 

ACATGAAACG TGAGATTTTA CTGGAACGAA TCGACAAACT AAAACAACTC ATGCCCTGGT 4 800 

AAGTTCTGGA AT ACT AC C AA 482 0 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CT AC G AC AT C ATGATTAACA GTCATGCGCT ACTACCAACT GAGCTATGGC GGATAAAATA 60 

GTCCGTACGG GATTCGAACC CGTGTTACCG CCGTGAAAAG GCGGTGTCTT AACCCCTTGA 12 0 

CCAACGGACC TTCTATCTGT AGCAGATATA ACCATTATAT CAATTTCTTG CTAATTGTCA 18 0 

ATCACTTTTG AGATTTTTTC TCTAAAATAT CTTTTAATTT TCTAATTTTT AATCTTGAAA 240 

TAG G AC AACG ATGGTCTTCA TAGAAAACAA TTTCTAAGTT TTTTCGATCA ATTTCTCTGA 3 00 

TATTACCTAT ATTTACCAAA AATGACTTGT GAGGAGAATA AAATCGCTGA GTATGTTTGT 3 60 

CCTTTTCCTG AATATCTGTC ATGGTACCAT AAAACTCTTT TG C AAAAT T C TTACCAATAA 42 0 
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TGCGCAATTT 


ATGAGATACC 


CCTGTTGTTT 


CAATATACAA 


AATATCATGG 


TAAGGAATTT 


480 


TTAAATCATT 


TCCCTTGTAA 


TTGTAGTCGA 


AATAATCTAC 


AACATCTTCA 


TTTTCAAGTA 


540 


ACATACTCTT 


CGTGTAGAAG 


ATATTTTGCT 


CAATTCTCTT 


CTTAAACATC 


TCATCATTGA 


600 


TATCCTTATC 


AACAAAATCT 


AGGGCTGATA 


CCTGGTATTT 


ATAGGTTAGA 


GTCGCAAACT 


660 


CTGATCGACT 


AGTGATAAAG 


ACGATAATAG 


CGTAAGGATT 


GTAATGACGA 


ATGAGCTGAG 


720 


CCACTTCAAA 


TCCCTTTTTC 


TCAATTCCAT 


GAATATCGAT 


AT CT AGG AAA 


TAAAGCTGAT 


780 


T T ACTT CAT C 


ATTTTCAATG 


TATTCTTCAA 


ATTCACGGAC 


TTTTCCCGTT 


GTCTTGTATG 


840 


ATATTGGAAT 


ATTCGATTCT 


TTCGAAATTT 


CATCCAATAT 


TCTCTCTAGT 


CTCACTTGAT 


900 


GTTCAATAAC 


ATCTTCTAAA 


ATTAAAACTT 


TCATTCAAAT 


TCCCTCTTAA 


ATCTAATGAT 


960 


TTGTCTAAAT 


GTACTGCCTT 


CCATCTCTGT 


TTCTAAAATA 


ATATTGTTGT 


ACTTATCTAG 


1020 


TAGTTCTTTC 


ACATTATTTA 


ATCCGACTCC 


GCGATTTCTT 


CCCTTAGTGG 


AGAATCCTAA 


1080 


GGCAAATAGA 


TCTCCTGAAG 


GAGTCATCGT 


CATTTTACAT 


GAATTCTGAA 


TCACAATAAC 


1140 


TGTTTCAGTT 


TCCATCTTAA 


TAACTGCTAC 


TTCCATCTGC 


TTTTTATAGC 


TATCAGCCGA 


1200 


TCCTTCGACA 


GC ATT AT T C A 


ATAAAACGCT 


CATGATACGA 


ACCAAATCCA 


ATAGTTCAAT 


1260 


TGGAAGCTTG 


GTAATCGTAT 


CTTTTACTTC 


CAGTGTAAAC 


TCTACACCAT 


TATTTCGAGC 


1320 


ATAGACAATT 


GACTGAGCAA 


CCAAACTTCG 


TAAAGCTGAG 


TCTTCTATGT 


TGTTCAAATC 


1380 


AAAGTAAGTG 


TACTTATCTG 


AACGCAATTT 


ATGATTTGCT 


TTGACTAAAA 


CTTCATTGTA 


1440 


AATTCTGTCA 


ATTTCCTGTA 


AATTACCACT 


GTCAATTGCC 


ATCTGCATGC 


T G AC AAG CAT 


1500 


TCCAGCATAA 


TCATGTCGAA 


AACCACGGAT 


TTCATTATAC 


AGACCAACAA 


TTTCATCTGT 


1560 


GTAATTCTGT 


AAATGTTTCT 


GTTCAAATTT 


CTTCTGCTTC 


AAAGCAATCT 


CTTTCTCCAT 


1620 


TTGAACTTTA 


TGAGAATT C A 


TTGCAAAGAA 


GGTCAAAAGG 


AGAGAGATAA 


AGACAATAGA 


1680 


TGACAAAATA 


CTTCCAAAAC 


TATTCAAATG 


TTTAATCGTA 


CTTACCATAT 


CTGAAACGAA 


1740 


AGATACAATA 


TGTAGCAATA 


GTAAAGCAAA 


AAATACTTTT 


TTCAAGAAAG 


GATAAAGGTA 


1800 


GTCCTTGTCA 


AAATAGGCTA 


GTTCCAAATG 


GAAATAGTAA 


ATGATTTTTA 


ATGTAACAAA 


1860 


ATAGGTTAAC 


ACCGTCACAA 


CGAAAAAGAA 


TGGGAAATGA 


TATTGTAAAA 


CAAAATTATC 


1920 


TCCTGTTATA 


GAGGAGAAAA 


T T ACGG AC AG 


AAAGTTATGA 


GTGCTCTCAT 


ATAAAAGAGA 


1980 


TAGTAGTAAA 


CTTAGGAATA 


GTCCTCTATC 


CCTCTCATAC 


TGTTTCATCC 


AT C G AAAAT A 


2040 


GGAATATAAG 


CCCAAAGGAA 


AT AAAAAT C T 


TTCAATCCCT 


ATTTTATCTA 


AATATAGAAG 


2100 


ATAAAAGGAA 


AATTCAAGTA 


CTATTTCAGT 


TAGTAATGTA 


TAAGCACCAA 


AAACGTATAA 


2160 


TTCTTTTCTA 


TTTATTCGAC 


CTTTACAAAT 


TAAACGGTAA 


CTGTGACTAA 


TAATTAAAAA 


2220 
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ATGAACAATA 


ACTGTCCCAA 


ATCCAAGTAA 


ATCCATTACT 


CTTTCTCCTT 


AT T T C AT T AC 


2280 


TTTTTTCGTA 


GGAAAAGAAA 


ATCAAGGATG 


ATTCTTGAAA 


TCCTCATCTC 


CCCACCTTTA 


2340 


ATCTTTTGTA 


AGTCTTTTTC 


CTTCAAAGCT 


ACAAACTGTT 


CCAATTTAAC 


TGTGTTTTTC 


2400 


ATAATAAAAT 


CTCCTAAAAT 


GTTTTTTCTT 


GTAAGCTAAC 


T T AC AAAAAC 


CATTATACAA 


2460 


AATGGAATTT 


CGTTTTAGAT 


AAAATTCTCT 


C AACTGT C AT 


TTTTTTCTCC 


CAAAGTGTAC 


2520 


TTTTTTAAGA 


AAAAAGCCGG 


GAAAATTCCC 


AGCTTTGCTA 


TT AT AT T GAT 


CCCAGCAGGA 


2580 


TTCGAACCTG 


CGACCGTTCG 


CTTAGAAGGC 


GAATGCTCTA 


TCCAGCTGAG 


CTATGAGACC 


2640 


TAATACAATT 


ATTCTACCAA 


AAATTCAATT 


AAAAGTCAAT 


TTTCTATTTA 


TGGTAGGGGA 


2700 


ATCCCTGCTG 


AATCGTAAAA 


GCGCGATAGA 


TTTGTTCAAC 


AAGAACTAGT 


CTCATTAACT 


2760 


GATGGGGTAA 


GGTTAGGCGA 


CCAAAACTGA 


CAGAAAGATT 


GGCTCTATTT 


TTT AC AG AT G 


2820 


ATGATAATCC 


TAAACTTCCC 


CCAATAATAA 


AAGTAAGAGT 


AGAAAATCCT 


TTTATAGAAG 


2880 


TTTCTTCTAA 


CTGCTTACTA 


AATTCTTCTG 


AGAAGAAAGT 


TTTCCCTTCA 


ATGGCTAACA 


2940 


CAATAACGAA 


ATCACGGTCA 


GCAATTTTTG 


ATAAAATTCT 


CTGACCTTCT 


ATTTCTAAAA 


3000 


TCTTTTGATT 


TTCTGATTCA 


CTGGCCTTAT 


CTGGTGTTTT 


TTCATCTGAT 


AACTCAATCA 


3060 


TTTCAAACTT 


AGCAAATCTA 


GAAATTCGTT 


TTGAATACTC 


TGCGATACCA 


TCTTTTAAAT 


3120 


ACTTTTCTTT 


CAGTTTCCCA 


ACTGTTACAA 


CTTTAATTTT 


CATGACTCTA 


TTCTAACATA 


3180 


TTCTCTATTT 


TTTCACATCT 


TATTCACAAA 


ATAAAAAATA 


GATTTCAATT 


AAGAAAATCA 


3240 


C AATT T C AAA 


AGAGTTATCC 


ACAGTTTGTG 


TAAAACTTTT 


GTGTTTAAGT 


TAT AATT AAG 


3300 


CTAGTCAGTT 


TATACTTTCA 


GT AATT C AAA 


CATATGGAGG 


CAAATATGAA 


ACATCTAAAA 


3360 


ACATTTTACA 


AAAAATGGTT 


TCAATTATTA 


GTCGTTATCG 


TCATTAGCTT 


TTTTAGTGGA 


3420 


GCCTTGGGTA 


GTTTTTCAAT 


AACTCAACTA 


ACTCAAAAAA 


GTAGTGTAAA 


CAACTCTAAC 


3480 


AACAATAGTA 


CTATTACACA 


AACTGCCTAT 


AAGAACGAAA 


ATTCAACAAC 


ACAGGCTGTT 


3540 


AACAAAGTAA 


AAGATGCTGT 


TGTTTCTGTT 


ATTACTTATT 


CGGCAAACAG 


ACAAAATAGC 


3600 


GTATTTGGCA 


ATGATGATAC 


TGACACAGAT 


TCTCAGCGAA 


TCTCTAGTGA 


AGGATCTGGA 


3660 


GTTATTTATA 


AAAAGAATGA 


TAAAGAAGCT 


TACATCGTCA 


CCAACAATCA 


CGTTATTAAT 


3720 


GGCGCCAgCA AAGTAGATAT 


TCGATTGTCA 


GATGGGACTA 


AAGTACCTGG 


AGAAATTGTC 


3780 


GGAGCTGACA 


CTTTCTCTGA 


TAT TGCTGTC 


GTCAAAATCT 


CTTCAGAAAA 


AGTGACAACA 


3840 


GTAGCTGAGT 


TTGGTGATTC 


TAGTAAGTTA 


ACTGTAGGAG 


AAACTGCTAT 


TGCCATCGGT 


3900 


AGCCCGTTAG 


GTTCTGAATA 


TGCAAATACT 


GTCACTCAAG 


GTATCGTATC 


CAGTCTCAAT 


3960 
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AGAAATGTAT CCTTAAAATC GGAAGATGGA CAAGCTATTT CTACAAAAGC CATCCAAACT 4 02 0 

GATACTGCTA TTAACCCAGG TAACTCTGGC GGCCCACTGA TCAATATTCA AGGGCAGGTT 4080 

ATCGGAATTA CCTCAAGTAA AATTGCTACA AATGGAGGAA CATCTGTAGA AGGTCTTGGT 4140 

TTCGCAATTC CTGCAAATGA TGCTATCAAT ATTATTGAAC AGTTAGAAAA AAACGGAAAA 4200 

GTGACGCGTC CAGCTTTGGG AATCC AG AT G GTTAATTTAT CTAATGTGAG TACAAGCGAC 42 60 

AT C AG AAG AC TCAATATTCC AAGTAATGTT ACATCTGGTG TAATTGTTCG TTCGGTACAA 43 2 0 

AGTAATATGC CTGCCAATGG TCACCTTGAA AAAT AC GAT G TAATTACAAA AGTAGATGAC 4380 

AAAGAGATTG CTTCATCAAC AGACTTACAA AGTGCTCTTT ACAACCATTC TATCGGAGAC 444 0 

ACCATTAAGA TAACCTACTA TCGTAACGGG AAAGAAGAAA CTACCTCTAT CAAACTTAAC 4 500 

AAGAGTTCAG GTGATTTAGA ATCTTAATTG ACATCTATGT AAAGAAAGCT TTACATAAGA 4560 

GAAAAGATGT GTTAGTGTAG AATCATGGAA AAATTTGAAA TGATTTCTAT CACAGATATA 4 620 

CAAAAAAATC CCTATCAACC CCGAAAAGAA TTTGATAGAG AAAAACTAGA TGAACTAGCA 4 680 

CAGTCTATCA AAGAAAATGG GGTCATTCAA CCGATTATTG TTCGTCAATC TCCTGTTATT 474 0 

GGTTATGAAA TCcTTGCAGG AGAGAGACGC TATCGGGCTT CACTTTTAGC TGGTCTACGG 4 800 

TCTATCCCAG CTGTTGTTAA ACAGATTTCA GACCAAGAGA TGATGGTCCA GTCCATTATT 4860 

GAAAATTTAC AGAGAGAAAA TTTAAACCCA ATAGAAGAAG CACGCGCCTA TGAATCTCTC 4 92 0 

GTAGAGAAAG GATTCACCCA TGCTGAAATT GCAGATAAGA TGGGCAAGTC TCGTCCATAT 4 980 

ATCAGCAACT CCATTCGTTT ACTTTCCTTG CCAGAACAGA TTCTTTCAGA AGTAGAAAAT 5040 

GGCAAACTAT CACAAGCCCA TGCGCGTTCC CTAGTTGGGT TAAATAAGGA ACAACAAGAC 5100 

TATTTCTTTC AACGGATTAT AGAAGAAGAT ATTTCTGTAA GGAAATTAGA AGCTCTTCTG 516 0 

ACAGAGAAAA AACAAAAGAA AC AG C AAAAA ACTAATCATT TCATACAAAA TGAAGAAAAA 522 0 

CAGTTAAGAA AACTACTCGG AT T AG ATGT A GAAATTAAAC TAT CT AAAAA AGACAGTGGA 52 80 

AAAATCATTA TTTCTTTTTC AAATCAAGAA GAATATAGTA GAATTATCAA CAGCCTGAAA 534 0 

TAAGGCTGTT CTTTTATTTT TTTATCTCAC AAGGTTATCC ACTATGTTTT TCGAT AAAAA 5400 

GCTTAATAAA TCAATAATTT CTTCTTTTAT CCCCAACCTG TGGATAAAGT TTGGTAACAT 54 60 

TGTGGATTAT TTTTCACAGC TTGTGGAAAA TTCTTGCTAT CTATGGTAAA ATATCTCTAG 552 0 

TAT T AAACTT TTAAATAGTA AAGGAGGAGA AAGGATTGAA AGAAAAACAA TTTTGGAATC 5580 

GTATATTAGA ATTTGCACAA GAAAGACTGA CTCGATCCAT GTATGATTTC TATGCTATTC 5640 

AAGCTGAACT CATCAAGGTA GAGGAAAATG TTGCCACTAT ATTTCTACCT CGCTCTGAAA 57 00 

TGGAAATGGT CTGGGAAAAA CAACTAAAAG ATATTATTGT AGTAGCTGGT TTTGAAATTT 57 6 0 
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ATGACGCTGA AATAACTCCC CACTATATTT TCACCAAACC TCAAGATACG ACTAGCTCAC 5 82 0 

AAGTTGAAGA AGCTACAAAT TTAACTCTTT ATAACTATAG T C C AAAGTT A GTATCTATTC 5880 

CTTATTCAGA TACGGGATTA AAAGAAAAGT ATACCTTTGA TAACTTTATT CAAGGGGATG 5 94 0 

GAAATGTTTG GGCTGTATCA GCCGCTTTAG CTGTCTCTGA AGATTTGGCT CTGACCTATA 6000 

ACCCTCTTTT TATCTATGGA GGACCAGGCC TTGGTAAGAC T C AC T T ATT A AACGCTATTG 6060 

GAAATGAAAT TCTAAAAAAT ATTCCTAATG CGCGTGTTAA ATATATCCCT GCCGAAAGCT 612 0 

TTATTAATGA CTTTCTTGAT CACCTAAGAC TTGGGGAAAT GGAAAAGTTT AAAAAGACCT 6180 

ATCGTAGTCT TGATCTTTTG TTAATCGATG AT AT C C AGTC ACTCAGCGGA AAAAAAGTCG 6240 

CAACTCAGGA AGAATTTTTC AATACCTTTA ACGCCCTTCA TGACAAGCAA AAACAGATTG 6300 

TCCTAACGAG TGATCGTAGT CCAAAACATC TAGAAGGGCT CGAGGAGAGG CTTGTCACGC 63 60 

GTTTTAGTTG GGGATTGACA CAAACTATCA CCCCCCCTGA CTTTGAAACA CGTATTGCCA 642 0 

TTTTACAAAG TAAGACGGAA CATTTAGGCT ACAATTTCCA AAGTGATACT CTAGAATACC 64 80 

TAGCTGGGCA ATTTGATTCA AATGTTCGAG ATCTTGAGGG AGCCATCAAC GACATCACTT 6540 

TAATTGCCAG AGTAAAAAAA AT C AAGG AT A T C ACT AT T G A TATTGCTGCA GAAGCCATTA 6 600 

GAGCCCGCAA ACAAGATGTT AG C C AAATGC TCGTCATCCC AATTGATAAA ATCCAAACTG 6 660 

AAGTTGGTAA CTTTTATGGT GTTAGTATCA AAGAAATGAA GGGAAGTAGA CGCCTTCAAA 672 0 

ATATTGTTTT GGCCCGTCAA GTAGCCATGT ATTTATCTAG AGAACTAACA GAT AAT AGTC 67 80 

TTCCAAAAAT TGGGAAGGAA TTTGGGGGAA AAG AT CAT AC CACAGTCATT CATGCCCATG 6840 

CCAAAATAAA ATCTTTGATT GATCAAG AC G ATAATTTACG TTTAGAAATT G AAT C AAT C A 6900 

AAAAGAAAAT CAAATAATTT GTGGATAACT TTTAGTTTTT TATCTTTTTT ATCCACATTT 69 60 

TTTAAACAAG CTAAAAAACT TGATATGACT TGTTTAAAGG CTGTTTTCCA CAGATTTCAC 7 02 0 

AGACTCTATT ATTACTATTA TCTTTCTAAT ACTAAAAATA AATAAAGGAG AATCCATGAT 7 0 80 

TCATTTTTCA ATTAATAAAA ATTTATTTCT ACAAGCATTA AAT AC T AC T A AGAGAGCTAT 714 0 

TAGTTCTAAA AATGCCATTC CTATTTTATC AACAGTAAAA ATTGACGTGA CCAATGAAGG 72 00 

TATTACTTTA ATTGGTTCAA ATGGTCAAAT TTCAATTGAA AATTTTATTT CTCAAAAAAA 7 2 60 

TGAAGATGCT GGTTTGTTAA TTACTTCTTT AGGTTCGATC CTTCTTGAAG CTTCTTTCTT 732 0 

TATCAATGTA GTATCTAGTT TACCTGATGT AACTCTTGAT TTTAAAGAAA TTGAACAAAA 73 80 

TCAAATTGTT TTAACCAGTG GCAAATCAGA AATTACCCTA AAAGGAAAAG AT AG CGAAC A 7 440 

AT ATC C AC G A ATCCAAGAAA TTTCAGCAAG CACTCCTTTA AT ACT TG AAA CAAAATTACT 7500 
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CAAGAAAATT ATTAATGAAA CAGCCTTTGC TGCAAGTACA CAAGAGAGTC GTCCGATTTT 7 560 

AACAGGTGTC CACTTCGTAT TGAGTCAACA CAAAGAGTTA AAAACAGTTG C AAC AG AC T C 7 62 0 

TCATCGCCTA AGCCAGAAAA AATTGACTCT TGAAAAAAAT AGTGATGATT TTGATGTCGT 7 680 

AATTCCTAGC CGTTCTCTAC GCGAATTTTC AGCGGTATTT ACAGATGATA TCGAAACTGT 7740 

AGAGATTTTC TTTGCCAATA ACCAAATCCT CTTTAGAAGC GAAAATATTA GCTTCTATAC 7 800 

TCGTCTCCTA GAAGGAAACT AT C C TG AT AC AGATCGCTTG ATTCCAACAG ACTTTAACAC 7860 

TACTATTACT TTTAATGTGG TAAACTTACG CCAGTCAATG GAGCGTGCCC GTCTTTTATC 7 92 0 

AAGTGCGACT CAAAATGGTA CTGTGAAACT TGAAATTAAG GATGGGGTTG TTAGCGCCCA 7980 

TGTTCACTCT CCAGAAGTTG GTAAAGTAAA CGAAGAAATC GATACTGATC AGGTTACTGG 8040 

TGAAGATTTG ACCATTAGTT TCAACCCAAC TTACTTGATT GATTCTCTTA AAGCTTTAAA 8100 

TAGCGAAAAG GTGACTATTA GCTTTATCTC AGCTGTTCGT C CAT T T ACT C TTGTGCCAGC 8160 

AGATACTGAC GAAGACTTCA TGCAGCTCAT TACACCAGTT CGTACAAATT AAGTGAAAGA 8220 

GGTTGAGCCT GGCTCGCCTC TTTTATGATA TAATCGAAAA AGAAAAGGAG AGTAGTATGT 8280 

ATCAAGTTGG AAATTTTGTT GAGATGAAAA AATCACACGC TTGTACAATC AAGTCGACTG 83 40 

GTAAAAAGGC TAATCGTTGG G AAAT T AC AC GTGTAGGAGC AGATATCAAA ATAAAATGTA 84 00 

GTAATTGTGA GCATGTTGTC ATGATGGGGC GATATGATTT TGAGCGAAAA ATGAATAAAA 8460 

TTATTGACTG AGAACCCTTA GTTAGAGGGT TAGCACTTTA TCCCTTTTTG TGTTATAATA 8520 

TTAGGGATTG AAATGAAAAC GGAGAATGAG AAATATGGCT TTGACAGCAG GTATCGTTGG 8580 

TTTGCCAAAC GTTGGTAAAT CAACACTATT TAATGCAATT ACAAAAGCAG GAGCAGAGGC 8 640 

AGCAAACTAC CCATTTGCGA CGATTGATCC AAATGTTGGA ATGGTGGAAG TTCCAGATGA 87 00 

ACGCCTACAA AAACTAACTG AAATGATAAC TCCTAAAAAG ACAGTTCCCA C AAC ATT T G A 87 60 

ATTTACAGAT ATTGCAGGGA TTGTAAAAGG AGCTTCAAAA GGAGAGGGGC TAGGGAATAA 882 0 

ATTCTTGGCC AATATTCGTG AAGTAGATGC GATTGTTCAC GTAGTTCGTG CTTTTGATGA 8880 

TGAAAATGTA ATGCGCGAGC AAGGACGTGA AGACGCCTTT GTAGATCCAC TTGCAGATAT 8940 

TG AT AC C ATT AATCTGGAAT TGATTCTTGC TGACTTAGAA TCAGTGAACA AACGATATGC 9000 

GCGTGTAGAA AAGATGGCAC GTACGCAAAA AGATAAAGAA TCAGTAGCAG AATTCAATGT 9060 

TCTTCAAAAG ATT AAAC C AG TCCTAGAAGA CGGGAAATCA GCTCGTACCA TTGAATTTAC 912 0 

AGATGAGGAA CAAAAGGTTG TCAAAGGTCT TTTCCTTTTG ACGACTAAAC CAGTTCTTTA 9180 

TGTAGCTAAT GTGGACGAGG ATGTGGTTTC AG AAC CT G AC TCTATCGACT ATGTCAAACA 92 40 

AATTCGTGAA TTTGCAGCGA CAGAAAATGC TGAAGTAGTC GTTATTTCTG CGCGTGCTGA 9300 
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GGAAGAAATT TCTGAATTGA ATGATGAAGA TAAAAAAGAG TTTCTTGAAG CCATTGGTTT 93 60 

GACAGAATCA GGTGTAGATA AGTTGACGCG TGCAGCTTAC CACTTGCTTG GATTGGGAAC 942 0 

TTACTTCACA G CTGGTG AAA AAGAAGTTCG CGCTTGGACT TTCAAACGTG GTATGAAGGC 9480 

TCCTCAAGCA GCTGGTATTA T C C AC T C AG A CTTTGAAAAA GGCTTTATTC GTGCAGTAAC 9540 

C ATGT CAT AT GAAGATCTAG TGAAATACGG ATCTGAAAAG GCCGTAAAAG AAGCTGGACG 9600 

CTTGCGTGAA GAAGGAAAAG AATATATCGT TCAAGATGGC GATATCATGG AATTCCGCTT 966 0 

TAATGTCTAA AAATTAATAA ATGGTGTCAA TTAGGTTGGA AAAAAATTCC AACCCTTTTG 972 0 

GCTTTTGAAA GGAAAAATAA ATGACCAAAT TACTTGTAGG CTTGGGAAAT CCAGGGGATA 9780 

AATATTTTGA AACAAAACAC AATGTTGGTT TTATGTTGAT TGATCAACTA GCGAAGAAAC 984 0 

AGAATGTCAC TTTTACACAC GATAAGATAT TTCAAGCTGA CCTAGCATCC TTTTTCCTAA 9900 

ATGGAGAAAA AATTTATCTG GTTAAACCAA CGACCTTTAT GAATGAAAGT GGAAAAGCAG 99 60 

TTCATGCTTT ATTAACTTAC TATGGTTTGG ATATTGACGA TTTACTTATC ATTTACGATG 10020 

ATCTTGACAT GGAAGTTGGG AAAATTCGTT T AAGAG C AAA AGGCTCAGCA GGTGGTCATA 10080 

ATGGTATCAA GTCTATTATT CAACATATAG GAACTCAGGT CTTTAACCGT GTTAAGATTG 1014 0 

GAATTGGAAG ACCTAAAAAT GGTATGTCAG TTGTTCATCA TGTTTTGAGT AAGTTTGACA 102 00 

GGGATGATTA TATCGGTATT TTACAGTCTG TTGACAAAGT TGACGATTCT GTAAACTACT 10260 

ATTTACAAGA GAAAAATTTT GAGAAAACAA TGCAGAGGTA TAACGGATAA ATGGTGACCT 1032 0 

TAT TAG ATT T ATTCTCAGAA AATG AT C AG A TTAAAAAATG GCATCAAAAT TTAACAGATA 103 80 

AGAAAAGACA ACTAATACTT GGTTT AT C AA CAT CT AC T AA GGCTCTTGCA ATTGCAAGCA 1044 0 

GTTTAGAAAA AGAAGATAGG ATTGTGTTAT TGACGTCAAC TTATGGAGAA GCAGAAGGAC 10500 

TTGTTAGTGA TCTTATTTCT ATCTTGGGTG AGGAACTCGT CTATCCATTT TTGGTAGATG 10560 

ATGCTCCTAT GGTGGAGTTT TTGATGTCTT CACAGGAAAA AATTATTTCA CGGGTTGAAG 10 620 

CCTTGCGTTT TTTGACTGAT T CAT C T AAG A AAGGGATTTT AGTTTGTAAT ATCGCAGCAA 10680 

GTCGATTGAT TTTACCGTCT CCCAATGCAT TCAAAGATAG TATTGTAAAA ATCTCAGTTG 10740 

GTGAAGAATA TGATCAACAC GCGTTTATCC AT C AGTT AAA GGAAAATGGC TATCGAAAAG 10800 

TTACTCAAGT ACAAACTCAG GGCGAATTTA GTCTTCGAGG AGATATTTTA GATATTTTTG 10860 

AAATATCCCA GTTAGAACCT TGTCGAATTG AGTTTTTTGG TGATGAAATT GATGGTATCA 10920 

GGTCATTTGA AGTAGAAACA CAATTATCGA AAGAAAATAA GACAGAACTC ACTATCTTTC 10980 

CAGCTAGTGA TATGCTTTTG AGAGAAAAGG ATTATCAACG AGGACAGTCA GCTTTAGAAA 11040 
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AACAAATTTC 


AAAAACTTTA 


TCACCTATTT 


TGAAATCATA 


CCTAGAAGAA 


ATTCTTTCAA 


11100 


GTTTTCACCA 


AAAACAAAGT 


C ATGC AG AC T 


CTCGGAAGTT 


TTTATCTTTG 


TGCTATGATA 


11160 


AG AC AT GG AC 


TGTCTTTGAT 


TATATTGAAA 


AAGATACTCC 


AATATTCTTT 


G ATG AT TAT C 


11220 


AAAAATTGAT 


GAATCAGTAT 


GAAGTCTTTG 


AAAGAGACTT 


AGCGCAGTAC 


TTTACAGAAG 


11280 


AATTACAGAA 


TAGTAAAGCA 


TTTTCTGATA 


TGCAGTATTT 


TTCTGATATT 


GAACAAATCT 


11340 


ATAAAAAACA 


AAGTCCAGTG 


ACCTTTTTCT 


CTAATCTTCA 


AAAGGGTTTA 


GGAAATCTCA 


11400 


AATTTGACAA 


AATTTATCAA 


TTCAATCAAT 


ATCCTATGCA 


GGAATTTTTC 


AATCAGTTTT 


11460 


CTTTTCTAAA 


AGAAGAAATT 


GAACGATATA 


AAAAAATGGA 


TTACACCATT 


ATTCTGCAGT 


11520 


CTAGCAATTC 


AATGGGAAGT 


AAAACATTGG 


AGGATATGTT 


AGAGGAATAT 


CAGATTAAAT 


11580 


TGGATTCTAG 


AGATAAGACA 


AATATCTGTA 


AAGAATCTGT 


AAACTTAATA 


GAGGGTAATC 


11640 


TCAGACATGG 


TTTTCATTTT 


G T AG ATG AAA 


AGATTTTATT 


GAT AAC TG AA 


CATGAGATTT 


11700 


TTCAAAAGAA 


ATTAAAGCGT 


CGTTTTCGAA 


G AC AAC AT G T 


TTCAAATGCA 


GAGAGATTAA 


11760 


AAGATTACAA 


TGAACTTGAA 


AAAGGGGACT 


ATGTTGTCCA 


TCATATCCAT 


GGGATTGGTC 


11820 


AATATCTAGG 


AATTGAAACC 


ATTGAAATCA 


AGGGAATTCA 


TCGCGATTAT 


GTCAGTGTCC 


11880 


AATACCAAAA 


TGGTGATCAA 


ATTTCTATCC 


CCGTGGAACA 


GATTCATCTA 


CTGTCCAAAT 


11940 


ATATTTCAAG 


TGATGGTAAA 


GCTCCAAAAC 


TCAATAAATT 


AAATGACGGT 


CATTTTAAAA 


12000 


AGGCCAAGCA 


AAAGGTTAAG 


AACCAGGTAG 


AGG AT AT AG C 


TGATGATTTA 


ATCAAACTCT 


12060 


ACTCTGAACG 


TAGTCAGTTG 


AAGGGTTTTG 


CTTTCTCAGC 


TG ATG AT GAT 


GATCAAGATG 


12120 


CCTTTGATGA 


TGCTTTCCCT 


TATGTTGAAA 


CGGATGATCA 


ACTTCGTAGT 


ATTGAGGAAA 


12180 


TCAAGAGGGA 


TATGCAGGCT 


TCTCAGCCAA 


TGGATCGACT 


TTTAGTTGGG 


GATGTTGGTT 


12240 


TTGGAAAGAC 


TGAAGTTGCT 


ATGCGTGCAG 


CCTTTAAAGC 


AGTCAATGAT 


CACAAACAGG 


12300 


TTGTCATTCT 


AGTTCCGACG 


ACGGTTTTAG 


CGCAACAGCA 


CTATACGAAT 


TTTAAGGAAC 


12360 


GATTCCAAAA 


TTTTGCAGTT 


AATATTGATG 


TGTTGAGTCG 


CTTTAGAAGT 


AAAAAAGAGC 


12420 


AGACTGCAAC 


ACTTGAAAAA 


TTGAAAAACG 


GTCAAGTCGA 


TATTTTGATT 


GGAACACATC 


12480 


GTGTTTTGTC 


AAAAGATGTT 


GTGTTTGCTG 


ATTTGGGCTT 


GATGATTATT 


GATGAGGAAC 


12540 


AGCGATTTGG 


TGTCAAGCAT 


AAGGAAACTT 


TGAAAGAACT 


GAAGAAACAA 


GTGGATGTCC 


12600 


TAACCTTGAC 


CGCTACGCCA 


ATCCCTCGTA 


CCCTCCATAT 


GTCTATGCTG 


GGAATCAGAG 


12660 


ATTTATCTGT 


TATTGAAACT 


CCGCCGACTA 


ATCGCTATCC 


TGTTCAGACC 


TATGTTTTGG 


12720 


AAAAGAATGA 


TAGTGTCATT 


CGTGATGCTG 


TCTTGCGTGA 


AATGGAGCGT 


GGAGGTCAAG 


12780 


TTTATTATCT 


TTACAACAAA 


GTTGACACAA 


TTGTTCAGAA 


GGTTTCAGAA 


TTACAGGAGT 


12840 
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TGATTCCGGA GGCTTCGATT GGATATGTTC ATGGTCGAAT GAGTGAAGTC CAGTTGGAAA 12 900 

AT AC TC TAT T AGACTTTATT GAGGGACAAT ACGATATCTT GGTGACGACT ACTATTATTG 12 960 

AGACAGGGGT GGACATTCCA AATGCTAATA CTTTATTTAT TGAAAATGCG G AC C AT ATGG 13 02 0 

GCTTGTCAAC CTTATATCAG TTAAGAGGAA GAGTCGGTCG TAGTAATCGT ATTGCTTATG 13 080 

CTTATCTCAT GTATCGTCCA GAAAAATCAA TCAGTGAAGT CTCTGAAAAG AG AT T AGAAG 1314 0 

CGATTAAAGG ATTTACAGAA TTGGGCTCTG GCTTTAAGAT TGCAATGCGA GATCTTTCGA 13 200 

TTCGTGGAGC AGGAAATCTT TTAGGAAAAT CCCAGTCTGG TTTCATTGAT TCTGTTGGTT 13 2 60 

TTGAATTGTA TTCGCAGTTA TTAGAGGAAG CTATTGCTAA ACGAAACGGT AATGCTAACG 13 32 0 

CTAACACAAG AACCAAAGGG AATGCTGAGT TGATTTTGCA AATTGATGCC TATCTTCCTG 133 80 

ATACTTATAT TTCTGATCAA CGACATAAGA TTGAAATTTA CAAGAAAATT CGTCAAATTG 13 440 

ACAACCGTGT CAATTATGAA GAGTTACAAG AGGAGTTGAT AGACCGTTTT GGAGAATACC 13 500 

CAGATGTAGT AGCCTATCTG TTAGAGATTG GTTTGGTCAA AT CAT ACT TG GACAAGGTCT 13 560 

TTGTTCAACG TGTGGAAAGA AAAGATAATA AAATTACAAT TCAATTTGAA AAAGTC AC T C 13 620 

AACGACTGTT TTTAGCTCAA GATTATTTTA AAGCTTTATC CGTAACGAAC TTAAAAGCAG 13 6 80 

GCATCGCTGA GAATAAGGGA TTAATGGAGC TTGTATTTGA TGTCCAAAAT AAGAAAGATT 137 40 

ATGAAATTTT AGAAGGTTTG CTGATTTTTG GAGAAAGTTT ATTAGAGATA AAAGAGTCTA 13 800 

AGGAAGAAAA TTCCATTTGA TATTTTTCTT CTATAAAATA GATAAAAATG GTACAATAAT 13 86 0 

AAATTGAGGT AATAAGGATG AGATTAGATA AATATTTAAA AG TAT CG CG A ATT AT C AAGC 13 92 0 

GTCGTACAGT CGCAAAGGAA GTAGCAGATA AAGGTAGAAT CAAGGTTAAT GGAATCTTGG 13 9 80 

CCAAAAGTTC AACGGACTTG AAAGTTAATG ACCAAGTTGA AATTCGCTTT GGCAATAAGT 14 04 0 

TGCTGCTTGT AAAAGTACTA GAGATGAAAG ATAGTACAAA AAAAGAAGAT GCAGCAGGAA 14100 

TGTATGAAAT TATCAGTGAA ACACGGGTAG AAGAAAATGT CTAAAAATAT TGTACAATTG 14160 

AATAATTCTT TTATTCAAAA TGAATACCAA CGTCGTCGCT ACCTGATGAA AGAACGACAA 1422 0 

AAACGGAATC GTTTTATGGG AGGGGTATTG ATTTTGATTA TGCTATTATT TATCTTGCCA 142 80 

ACTTTTAATT TAGCGCAGAG TTATCAGCAA TTACTCCAAA GACGTCAGCA ATTAGCAGAC 143 4 0 

TTGCAAACTC AGTATCAAAC TTTGAGTGAT GAAAAGGATA AGGAGACAGC ATTTGCTACC 14400 

AAGTTGAAAG ATGAAGATTA TGCTGCTAAA TATACACGAG CGAAGTACTA TTATTCTAAG 14460 

TCGAGGGAAA AAGTTTATAC GATTCCTGAC TTGCTTCAAA GGTGATAAAA TGGAAAATTT 14520 

ATTAGACGTA ATAGAGCAAT TTTTGAGTTT GTCAGATGAA AAGCTGGAAG AATTGGCTGA 14580 
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TAAAAATCAA TTATTGCGTT TACAAGAAGA AAAGGAAAGG AAGAATGCGT AAATTCTTAA 14 64 0 

TTATTTTGTT GCTACCAAGT TTTTTGACCA TTTCAAAAGT CGTTAGCACA GAAAAAGAAG 14700 

TCGTCTATAC TTCGAAAGAA ATTTATTACC TTTCACAATC TGACTTTGGT ATTTATTTTA 147 60 

GAGAAAAATT AAGTTCTCCC ATGGTTTATG GAGAGGTTCC TGTTTATGCG AATGAAGATT 14 82 0 

TAGTAGTGGA ATCTGGGAAA TTGACTCCCA AAACAAGTTT TCAAATAACC GAGTGGCGCT 14 880 

TAAATAAACA AGGAATTCCA GTATTTAAGC TATCAAATCA TCAATTTATA GCTGCGGACA 14 940 

AACGATTTTT ATATGATCAA TCAGAGGTAA CTCCAACAAT AAAAAAAGTA TGGTTAGAAT 15000 

CTGACTTTAA ACTGTACAAT AGTCCTTATG ATTTAAAAGA AGTGAAATCA TCCTTATCAG 15060 

C TT ATT C GC A AGTATCAATC GACAAGACCA TGTTTGTAGA AGGAAGAGAA TTTCTACATA 1512 0 

TTGATCAGGC TGGATGGGTA GCTAAAGAAT CAACTTCTGA AGAAGATAAT CGGATGAGTA 15180 

AAGTTCAAGA AATGTTATCT G AAAAAT AT C AGAAAGATTC TTTCTCTATT TATGTTAAGC 152 4 0 

AACTGACTAC TGGAAAAGAA GCTGGTATCA ATCAAGATGA AAAGATGTAT GCAGCCAGCG 153 00 

TTTTGAAACT CTCTTATCTC TATTATACGC AAGAAAAAAT AAATGAGGGT CTTT AT C AG T 15 3 60 

TAGATACGAC TGTAAAATAC GTATCTGCAG TCAATGATTT TCCAGGTTCT T AT AAAC C AG 1542 0 

AGGGAAGTGG TAGTCTTCCT AAAAAAGAAG ATAATAAAGA ATATTCTTTA AAGGATTTAA 154 80 

TTACGAAAGT ATCAAAAGAA TCTGATAATG TAGCTCATAA TCTATTGGGA TATTACATTT 15540 

CAAACCAATC TGATGCCACA TTCAAATCCA AGATGTCTGC CATTATGGGA GATGATTGGG 15 6 00 

AT C C AAAAG A AAAATTGATT TCTTCTAAGA TGGCCGGGAA GTTTATGGAA GCTATTTATA 15660 

ATCAAAATGG ATTTGTGCTA GAGTCTTTGA CTAAAACAGA TTTTGATAGT CAGCGAATTG 15720 

CCAAAGGTGT TTCTGTTAAA GTAGCTCATA AAAT TGG AG A TGCGGATGAA TTTAAGCATG 15780 

ATACGGGTGT TGTCTATGCA GATTCTCCAT TTATTCTTTC TATTTTCACT AAGAATTCTG 15840 

ATTATGATAC GATTTCTAAG ATAGCCAAGG ATGTTTATGA GGTTCTAAAA TGAGGGAACC 159 00 

AGATTTTTTA AATCATTTTC TCAAGAAGGG ATATTTCAAA AAGCATGCTA AGGCGGTTCT 15960 

AGCTCTTTCT GGTGGATTAG ATTCCATGTT TCTATTTAAG GTATTGTCTA CTTATCAAAA 16020 

AG AGT TAG AG ATTGAATTGA TTCTAGCTCA TGTGAATCAT AAGCAGAGAA TTGAATCAGA 16080 

TTGGGAAGAA AAGGAATTAA GGAAGTTGGC TGCTGAAGCA GAGCTTCCTA TTTATATCAG 1614 0 

CAATTTTTCA GGAGAATTTT CAGAAGCGCG TGCACGAAAT TTTCGTTATG ATTTTTTTCA 16200 

AGAGGTCATG AAAAAGACAG GTGCGACAGC TTTAGTCACT GCCCACCATG CTGATGATCA 16260 

GGTGGAAACG ATTTTTATGC GCTTGATTCG AGGAACTCGC TTGCGCTATC TATCAGGAAT 16320 

TAAGGAGAAG CAAGTAGTCG GAGAGATAGA AATCATTCGT CCCTTCTTGC ATTTTCAGAA 163 80 
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AAAAGACTTT CCATCAATTT TTCACTTTGA AGATACATCA AATCAGGAGA AT C ATT AT T T 16440 

TCGAAATCGT ATTCGAAATT CTTACTTACC AGAATTGGAA AAAGAAAATC CTCGATTTAG 16 500 

GGATGCAATC TTAGGCATTG GCAATGAAAT TTTAGATTAT GATTTGGCAA TAGCTGAATT 16560 

ATCTAACAAT ATTAATGTGG AAG AT T T AC A GCAGTTATTT TCTTACTCTG AGTCTACACA 16620 

AAGAGTTTTA CTTCAAACTT ATCTGAATCG TTTTCCAGAT TTGAATCTTA CAAAAGCTCA 1668 0 

GTTTGCTGAA GTTCAGCAGA TTTTAAAATC TAAAAGCCAG TATCGTCATC CGATTAAAAA 1674 0 

TGGCTATGAA TTGATAAAAG AGTACCAACA GTTTCAGATT TGTAAAATCA GTCCGCAGgC 16800 

TGATGAAAAG GAAGATGAAC TTGTGTTACA CTATCAAAAT CAGGTAGCTT AT C AAGGAT A 16860 

TTTATTTTCT TTTGGACTTC CATTAGAAGG TGAATTAATT CAACAAATAC CTGTTTCACG 16920 

TGAAACATCC AT AC AC AT T C GTCATCGAAA AACAGGAGAT GTTTTGATTA AAAATGGGCA 16980 

TAGAAAAAAA CTCAGACGTT TATTTATTGA TTTGAAAATC CCTATGGAAA AGAGAAACTC 17 040 

TGCTCTTATT ATTGAGCAAT TTGGTGAAAT TGTCTCAATT TTGGGAATTG CGACCAATAA 17100 

TTTGAGTAAA AAAACGAAAA ATGATATAAT GAACACTGTA CTTTATATAG AAAAAATAGA 17160 

TAGGTAAAAA ATGT T AG AAA AC GAT ATT AA AAAAGTCCTC GTTTCACACG AT G AAATT AC 1722 0 

AGAAGCAGCT AAAAAACTAG GTGCTCAATT AACTAAAGAC TATGCAGGAA AAAATCCAAT 172 80 

CTTAGTTGGG ATTTTAAAAG GATCTATTCC TTTTATGGCT GAATTGGTCA AACATATTGA 17340 

TACACATATT GAAATGGACT TCATGATGGT TTCTAGCTAC CATGGTGGAA CAGCAAGTAG 174 00 

TGGTGTTATC AATATTAAAC AAGATGTGAC T C AAG AT AT C AAAGGAAGAC ATGT T C T AT T 174 60 

TGTAGAAGAT ATCATTGATA CAGGTCAAAC TTTGAAGAAT TTGCGAGATA TGTTTAAAGA 17 520 

AAGAGAAGCA GCTTCTGTTA AAATTGCAAC CTTGTTGGAT AAACCAGAAG GACGTGTTGT 175 80 

AGAAATTGAG G C AGACT AT A CTTGCTTTAC TATCCCAAAT GAGTTTGTAG TAGGTTATGG 17 640 

TTTAGACTAC AAAGAAAATT ATCGTAATCT TCCTTATATT GGAGTATTGA AAGAGGAAGT 17700 

GTATTCAAAT TAGAAAGAAT AATCTTTAAT GAAAAAACAA AATAATGGTT TAATTAAAAA 177 60 

TCCTTTTCTA TGGTTATTAT TTATCTTTTT CCTTGTGACA GGATTCCAGT ATTTCTATTC 17 820 

TGGGAATAAC TCAGGAGGAA GTCAGCAAAT CAACTATACT GAGTTGGTAC AAGAAATTAC 17880 

CGATGGTAAT GTAAAAGAAT T AACTT AC C A ACCAAATGGT AGTGTTATCG AAGTTTCTGG 17 940 

TGTCTATAAA AATCCTAAAA CAAGTAAAGA AGAAACAGGT ATTCAGTTTT TCACGCCATC 18 000 

T GTT ACT AAG GTAGAGAAAT TTACCAGCAC TATTCTTCCT GCAGATACTA CCGTATCAGA 18060 

ATTGCAAAAA CTTGCTACTG AC C AT AAAGC AGAAGTAACT GTTAAGCATG AAAGTTCAAG 18120 
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TGGTATATGG 


ATTAATCTAC 


TCGTATCCAT 


TGTGCCATTT 


GGAATTCTAT 


TCTTCTTCCT 


18180 


ATTCTCTATG 


ATGGGAAATA 


TGGGAGGAGG 


CAATGGCCGT 


AATCCAATGA 


GTTTTGGACG 


18240 


TAGTAAGGCT 


AAAGCAGCAA 


ATAAAGAAGA 


TATTAAAGTA 


AGATTTTCAG 


ATGTTGCTGG 


18300 


AGCTGAGGAA 


GAAAAACAAG 


AACTAGTTGA 


AGTTGTTGAG 


TTCTTAAAAG 


ATCCAAAACG 


18360 


ATT C AC AAAA 


CTTGGAGCCC 


GT ATT C C AG C 


AGGTGTTCTT 


TTGGAGGGAC 


CTCCGGGGAC 


18420 


AGGTAAAACT 


TTGCTTGCTA 


AGGCAGTCGC 


TGGAGAAGCA 


GGTGTTCCAT 


TCTTTAGTAT 


18480 


CTCAGGTTCT 


GACTTTGTAG 


AAATGTTTGT 


CGGAGTTGGA 


GCTAGTCGTG 


TTCGCTCTCT 


18540 


TTTTGAGGAT 


GCCAAAAAAG 


CAGCACCAGC 


TATCATCTTT 


AT C G ATG AAA 


TTGATGCTGT 


18600 


TGGACGTCAA 


CGTGGAGTCG 


GTCTCGGCGG 


AGGTAATGAC 


GAACGTGAAC 


AAACCTTGAA 


18660 


CCAACTTTTG 


ATTGAGATGG 


ATGGTTTTGA 


GGGAAATGAA 


GGGATTATCG 


TCATCGCTGC 


18720 


GACAAACCGT 


TCAGATGTAC 


TTGACCCTGC 


CCTTTTGCGT 


CCAGGACGTT 


T T GAT AG AAA 


18780 


AGTATTGGTT 


GGTCGTCCTG 


ATGTTAAAGG 


TCGTGAAGCA 


ATCTTGAAAG 


TTCACGCTAA 


18840 


GAATAAGCCT 


TTAGCAGAAG 


ATGTTGATTT 


GAAATTAGTG 


GCTCAACAAA 


CTCCAGGCTT 


18900 


TGTTGGTGCT 


GATTTAGAGA 


ATGTCTTGAA 


TGAAGCAGCT 


TTAGTTGCTG 


CTCGTCGCAA 


18960 


TAAATCGATA 


ATTGATGCTT 


C AG AT AT TG A 


TGAAGCAGAA 


GATAGAGTTA 


TTGCTGGACC 


19020 


TTCTAAGAAA 


GATAAGACAG 


TTTCACAAAA 


AGAACGAGAA 


TTGGTTGCTT 


ACCATGAGGC 


19080 


AGG AC AT AC C 


ATTGTTGGTC 


TAGTCTTGTC 


GAATGCTCGC 


GTTGTCCATA 


AGGTTACAAT 


19140 


TGTACCACGC 


GGCCGTGCAG 


G CGG AT AC AT 


GATTGCACTT 


CCTAAAGAGG 


ATCAAATGCT 


19200 


TCTATCTAAA 


GAAGATATGA 


AAGAGCAATT 


GGCTGGCTTA 


ATGGGTGGAC 


GTGTAGCTGA 


19260 


AG AAATT AT C 


TTTAATGTCC 


AAACCACAGG 


AGCTTCAAAC 


GACTTTGAAC 


AAGCGACACA 


19320 


AATGGCACGT 


GCAATGGTTA 


CAGAGTACGG 


TATGAGTGAA 


AAACTTGGCC 


CAGTACAATA 


19380 


TGAAGGAAAC 


CATGCTATGC 


TTGGTGCACA 


GAGTCCTCAA 


AAATCAATTT 


CAGAACAAAC 


19440 


AGCTTATGAA 


ATTGATGAAG 


AGGTTCGTTC 


ATTATTAAAT 


GAGGCACGAA 


ATAAAGCTGC 


19500 


TGAAAT TAT T 


CAGT C AAATC 


GTGAAACTCA 


CAAGTTAATT 


GCAGAAGCAT 


TATTGAAATA 


19560 


CGAAACATTG 


GATAGTACAC 


AAATTAAAGC 


TCTTTACGAA 


ACAGGAAAGA 


TGCCTGAAGC 


19620 


AGTAGAAGAG 


GAATCTCATG 


CACTATCCTA 


TGATGAAGTA 


AAGTCAAAAA 


TGAATGACGA 


19680 


AAAATAACCC 


TGAGAGAGGC 


TGGAGCCTCT 


CTTTTTTGTG 


CAGTTTAGGA 


GCTAAAGGGA 


19740 


ACAGAATGGA 


GAAAATGGAA 


CAAATGTGTT 


TTCTAATCTG 


TTAGACTGTA 


TCTAGAAAGG 


19800 


GGAAAATTAT 


GATTAAAGAA 


TTGTATGAAG 


AAGTCCAAGG 


GACTGTGTAT 


AAGTGTAGAA 


19860 


ATGAATATTA 


CCTTCATTTA 


TGGGAATTGT 


CGGATTGGGA 


GCAAGAAGGC 


ATGCTCTGCT 


19920 
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TACATGAATT 


GATTAGTAGA 


GAAGAAGGAC 


T GGT AG AC G A 


TATTCCACGT 


TTAAGGAAAT 


19980 


ATTTCAAGAC 


CAAGTTTCGA 


AATCGAATTT 


TAGACTATAT 


CCGTAAACAG 


G AAAGT C AG A 


20040 


AG CGT AG AT A 


CGATAAAGAA 


CCCTATGAAG 


AAGTGGGTGA 


GATCAGTCAT 


CGTATAAGTG 


20100 


AGGGGGGTCT 


CTGGCTAGAT 


GATTATTATC 


TCTTTCATGA 


AACACTAAGA 


GAT TAT AG AA 


20160 


ACAAACAAAG 


TAAAGAGAAA 


CAAGAAGAAC 


TAGAACGCGT 


CTTAAGCAAT 


GAACGATTTC 


20220 


GAGGGCGTCA 


AAGAGTATTA 


AG AG ACT T AC 


GCATTGTGTT 


TAAGGAGTTT 


ACTATCCGTA 


20280 


CCCACTAGTA 


AGTCATGCAA 


AAAAAATGAA 


AAAAAT TAG A 


AAAAGTAGTT 


GACAAAGTTT 


20340 


GAAAAGGCTG 


TATAATAGTA 


AGAGTTGAAA 


ATAACAACTC 


AGGTCCGTTG 


GTCAAGGGGT 


20400 


TAAGACACCG 


CCTTTTCACG 


GCGGTAACAC 


GGGTTCGAAT 


CCCGTACGGA 


CTATGGTATG 


20460 


TTGCGTCAGG 


ACCACTTGAT 


GAAAAAAAGT 


TTAAAAAAAC 


TTAAAAATCT 


TCAAAAAAGT 


20520 


GTTGACAAGC 


GAAAGCAGTT 


GT GAT AT AC T 


AATATAGTTG 


TCGCTTGAGA 


GAAGCAAGTG 


20580 


AC AAAG AC C T 


TTGAAAACTG 


AACAAGACGA 


ACCAATGTGC 


AGGGCGCTAC 


AACGTAAGTT 


20640 


GTAGTACTGA 


ACAATGAAAA 


AAACAATAAA 


TCTGTCAGTG 


ACAGAAATGA 


GTAAGAACTC 


20700 


AAACTTTTTA 


ATGAGAGTTT 


GATCCTGGCT 


CAGGACGAAC 


GCTGGCGGCG 


TGCCTAATAC 


20760 


ATGCAAGTAG 


AACGCTGAAG 


GAGGAGCTTG 


CTTCTCTGGA 


TGAGTTGCGA 


ACGGGTGAGT 


20820 


AACGCGTAGG 


TAACCTGCCT 


GGTAGCGGGG 


GATAACTATT 


GGAAACGATA 


GCTAATACCG 


20880 


CATAAGAGTA 


G ATGTT G CAT 


GACATTTGCT 


TAAAAGGTGC 


ACTTGCATCA 


CTACCAGATG 


20940 


GACCTGCGTT 


GTATTAGCTA 


GTTGGTGGGG 


TAACGGCTCA 


CCAAGGCGAC 


GAT AC AT AG C 


21000 


CGACCTGAGA 


GGGTGATCGG 


C C AC AC TGGG 


ACTGAGACAC 


GGCCCAGACT 


CCTACGGGAG 


21060 


GCAGCAGTAG 


GGAATCTTCG 


GCAATGGACG 


GAAGTCTGAC 


CGAGCAACGC 


CGCGTGAGTG 


21120 


AAGAAGGTTT 


TCGGATCGTA 


AAGCTCTGTT 


GTAAGAGAAG 


AACGAGTGTG 


AGAGTGGAAA 


21180 


G T TC AC AC T G 


TGACGGTATC 


TTACCAGAAA 


GGGACGGCTA 


ACTACGTGCC 


AGCAGCCGCG 


21240 


GTAATACGTA 


GGTCCCG AG C 


GTTGTCCGGA 


TTTATTGGGC 


GT AAAGC G AG 


CGCAGGCGGT 


21300 


TAGATAAGTC 


TGAAGTTAAA 


GGCTGTGGCT 


TAACCATA 






21338 


(2) INFORMATION FOR SEQ ID NO: 21: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6273 base pairs 
<B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGTTTTTAAA GAGCCGTGTC TGGATAGACT TTCGGACGCA ACGCTCTATT AGATAATGAA 60 
CTGCCTATAC ACAAGATTTC TAACCTTAGT CGACATGAGC TGAAACCTCT TATTTGTTAA 12 0 

GTAGTTCACA AAATATTATA CACCTATTTT ATGAATAGTC AACTGTCTTT ACAGTAAAAT 180 
TTTAGAAAAT CATGAAAATT TTCTCTTTCT TTCCATTTTA AGTGACATTC AGTCATTCTC 240 
ACATCAAAAA AGCCCAGACG AAATTGTCTG AGCATTCTTT TATCTAGTCG TTTAAGGAAG 3 00 

TTGAGTTCAG TATGTTTAAA GTCTCTGTCC CATCATTTCT TCAACAAACC TTGTTCTTGG 3 60 

AG AAACT C CT TGGCTACTTG CTTTGCTGAC TTGCCTTCAA CACCGACTTG GTAGTTGAGC 42 0 

TGGCTCATCT GGCTTTCTGT AATCTTACCA GCCAATGTAT TAAGAACTCT TTCCAACTCT 480 
GGGTGTTTCT TGAGAAGAGC TTCTTTCATG AGTGGAGCCC CTTGATAAGG TGGGAAGAGT 540 
TGCTTGTCAT CTTCCAAGAC CTGTAAATCA TAACGCTCCA ATTCCGCATC AGTCGAATAG 6 00 

GCATCCGTGA TTTGAATATC CCCTGACTGA AT AG C CTG AT AGCGAAGGGC TGGCTCAATG 660 
GT CG CT AC AT TGAGATTGAG ACCATACATT GATTGCAAGC CCTTATTTCC ATCTTCACGG 72 0 

TCGTTAAACT CGAGTGTAAA ACCTGCCTTC AACTGCCCTT CCACTTTTTT CAAGTCTGAA 780 
ATGGTCTTCA AGCC AT AT T C TTGAGCAATC TTTTTCGGAA CAGCTACAGC ATAGGTGTTT 840 
T G AT AAG AC A TGGGTTTGAG ATAGGCTAGA TGATCCTGCT TAGCAATGCC ATCACGCGCC 900 
AC CTG AT AAA CCTGTTCTGG TTCATGACTC ACCTTGGGTG ATGGTTGAAG CAAACTTTCA 96 0 

GTCACCGTAC CAGTAAATTC AGGATAGATG TCAATATCGC CTTTTTTCAG AGCTTCATAA 102 0 

AGGAAGCTTG TCTTCCCAAA ATTCGGTTTA ACAGTCGCAG TCATGCTGGT ATTTTCTTCA 1080 

ATCAGCAACT TATACATATT GGCCAAAATT TCTGGTTCTG G AC CT AT T T T CCCAGCAATA 114 0 

ACCAAGTTTT CCTTCTCTTT TTG AAC C AAA AGAGCTGGAC TATAAGACAG ACCCAGTAAT 12 00 

AAAGCCACCA AGGCAAAACC TGAGAAAAT C GTCCGTAATT TTGCTTTTTC CATCACTTTT 12 6 0 

AGTAGGAAGT TAAAGGCAAT GGCTAGCACT G C AG AAG AAA GTGCCCCAAT CAAAATCAAA 13 2 0 

CTGGCATTAT TACGGTCAAT TCCCAAAAGA ATAAAGGAAC CTAGTCCCCC TGCACCAATC 13 80 

AAGGCCGCCA AGGTTGCCGT ACCGATAATC AAAAC AG CTG CCGTCCGAAT CCCAGACATG 1440 

ATAACAGGCA TGGCGAGTGG AATTTCAAAT TTCTTGAGAC GTTCCCATCT GGTCATCCCA 150 0 

AAGGCAATCC CAGCCTCTTG CAGGTTCGGA TCAATTCCCT TCAGCCCAGT GATAGTATTT 15 60 

TGCAAAATAG GGAAAATCGC ATAAATCACT AG AG CTGT C A AAGCCGGCAA GGTCCCAATT 162 0 

CCCATCAAAG GGATAAAGAG CCCCAACAAG GCCAGAGACG GGATGGTCTG GAAAATACCT 16 8 0 

GCAATCTGCA AGACCCAGTC GGCCAGCTTC TCATGATAGC GAAGAAAAAC AGCCAAGGGA 1740 
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ATCGCAAGCA 


AAAT AG C TAG 


TAACAAGGTC 


AAAAGCGACA 


ACTGCAAATG 


TTGAGATAGA 


1800 


GCTGTCAACC 


AAT C AC T AAA 


ACGATCCTGA 


AAAGTTGCAA 


TTAAATTAGT 


CATGAACACT 


1860 


ACCTCCAAAC 


AAGTCTGCTA 


CAAAGTCTGT 


TGCAGGCGCT 


TTTAAAATTG 


TCTCGGGATT 


1920 


CGCTACCTGG 


CGAATTTCTC 


CATCCTGCAA 


G AC AG C AAT A 


CGGTCCGCCA 


ACTTCAAGGC 


1980 


TTCATCCGTA 


TCATGGGTTA 


CAAAAATCGT 


TGTCATCCCA 


AACTCTTTAT 


GCAATTCTTT 


2040 


TGTCAGAACC 


TGCAACTGTT 


TTCTCGAAAT 


AGCATCCAAG 


GCCGAAAAGG 


GTTCATCCAT 


2100 


GAGGAAAATC 


TTGGGCTGAC 


CAATCATAGC 


TCGGACAATA 


CCGACCCGTT 


GCTGTTCTCC 


2160 


AC C AG AT AAT 


TCACTAGGTA 


AGCGATGCCC 


ATACTCGGCT 


ACTGGTAAAC 


CAACCTTAGC 


2220 


CAAAAGCTCT 


TCTGTTTTCT 


TCGTAATTTC 


TTCCTTGCTC 


CACCCCTTCA 


TTTCAGGAAT 


2280 


GAGAGCAATA 


TTTTCCGCAA 


CTGTTAGATT 


TGGAAAAAGA 


G C AAT AG C C T 


GTAAAACATA 


2340 


ACCAGTAGAA 


AGACGAAGTT 


CACGCTCATC 


ATAGTCTTTG 


ATGCGCTTCC 


CATCCATATA 


2400 


AAT AT T TC C A 


TCAGTTGGTT 


C C AAAAGAC G 


GTTAATCATC 


TTGAGCATGG 


TCGTCTTACC 


2460 


TGACCCAGAA 


GGCCCTACTA 


AAAC CAT AAA 


TTCCCCATCC 


TCAATCTGTA 


AGTTGACATC 


2520 


TCTCAAGACA 


TCCTTTTCTG 


TGTAGCGCAG 


TGCTACATTT 


TTGTATTCAA 


TCATTCTTTG 


2580 


TCCTCAATTT 


AAAACTTCCC 


TCGATTGGTC 


AAGTCTTCTA 


CCTTAGGCAT 


AACTTCCTTA 


2640 


TTATCCCAAT 


GCTCCACAAT 


TTTCCCGTTC 


TCTAAACGGA 


AGATATCGTA 


CTGGGCATAA 


2700 


GCAACGCCAT 


CAATCTGAGT 


CTG AC CAT AG 


CTAACCACAT 


AGTTTCCTTG 


TCCTAAGAGT 


2760 


TGGAAAACAA 


AGTCAAAAGT 


G AC AC TAT AT 


TCAGCCACAT 


AGTTTTTATA 


AGCAGCACTT 


2820 


CCTTGTCCAA 


TATCATGATT 


ATGCTGAATC 


AAATCGTCTG 


C C AC AT AAT C 


ACTCCACTGC 


2880 


TCTAGCTCCC 


CATTTTGGAA 


AATTTCTGTC 


AAGAAACGGC 


GAACCAGCTT 


TTTATTTTCT 


2940 


GCTTTCTTAT 


CCAAATCCTT 


GATTTCAAAA 


TCTCCAAAAA 


TTTGATCTAG 


TTGGTCATTT 


3000 


TCAGGTGTTC 


GATAGTAGTC 


AATGACATCC 


CAATGCTCAA 


CAATACAACC 


ATTCTCATCC 


3060 


TCACGGAAAG 


TATCCGTCGT 


CACCCATTGA 


GCTTCTCCAC 


CATTCAGATA 


TTGATGAACA 


3120 


TGAACAAAGA 


CCAGATTGCC 


ATCCTCAATG 


GTGCGGACAA 


TCTTAATCTG 


ACGCTCTGGA 


3180 


TGACGCTCAA 


AGAAATCTGC 


AAAGAAGGCT 


GCAAATCCTT 


CTTTCCCGTC 


AGGAACACCT 


3240 


GTCGAATGTT 


GGATATAGGT 


ATCCCCTACA 


GACTGGGCTT 


GAGCCTCAGC 


AACTCGTCCG 


3300 


TCTTGAATGG 


CATGGATGTA 


TAGGTTGTGA 


GCATTTTTCA 


CTTGTTGTGA 


CATATTCTAA 


3360 


ACCTCATTTC 


CCTTCTCTTT 


CAGATTCGCC 


AAAATTCTTT 


CTTGAAAACC 


TTCAAATTGG 


3420 


TGAATTTCTT 


CCTCTGAAAA 


TCCTTTGTAA 


AAGATAGTAT 


CCAATTTCTG 


ACTG AC AC G A 


3480 
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TGCCCCACTT CTTTCTGGGA CTTGCCTAAC TCCGTTAAAA CTAAATACTT CTTACGCTTG 3 54 0 

TCTTTTCCAC ACGGACTAAC AATTACAAGC TTTTGTTCCT CTAGCTTTTT TATCATAGTC 3 600 

GTCAGCGTAT TATTCGCAAG TCCAGTCGCA AGCGCGATAT CTGTCGCAGT TGCGCAGCCA 3 660 

GTTTCACTAT TCCATAAAAC CGCTAAAATC TTGCCCTGTT CACCCCTATA AAGAGCCTCA 3720 

GGATCTTGAC TCAGTAACTT TTGAAAAATC CGCCCATTCA ACAAACGAAT ATGATGGGCT 3 7 80 

AGCAAATGAC CATCTTTCAT AACACCTCCA ATTTATTTCG ATATCGAAAT GAATAAAACA 3 84 0 

ATTGTAACAC TCATCGTTCT AACTGTCAAC TATTTCGATT TAGAAATAAT TTTTGATAAT 3 900 

TATCCACACC ACCATACTCC GGCTCAACTA ACTTTTAACG AGAGTTTCTA AACTCCTTCG 3 960 

TCCTCCAGTC TACAAAAGCC TTCCATTCGT ACTATCCTAT ATTTTATGAG GGGACACATT 402 0 

TTTCCTATCA GACCATTTAT TTTAAAGATA GAAGTAAATC ATAATTGCTT CCATCTGTTC 4 080 

TTTTATAGTA TATTGAAGTT AG ACT AG AG C ACTGTATCTT CTAAAACATT G AT AG AAAG C 414 0 

GATTTGAATT TCCCAATCAA TTTGTTCGTA TTTATAGCAT TTCGAAACTG GAATAGGACA 4200 

CCATGACTGC TAAAAGATTT CTATAAATTC ATTTAATTTC CTCAATCAAT TTGTTCATAT 42 60 

CTTATTTCAT TCCGCTATAA TTTCACCTTA CCCTATCTTT TTCGTAGCAC CCTTCAAACA 4 32 0 

GCCTATCCCC TACCGTTTGA CGATTCCTCA CTTCGCTCCA CTTCCATTAC AGAAGTTTCT 4 3 80 

TC AC TACT AT GGGCTCGGCT GACTTCTCAT GATTCCTTGT TACTACTATT TGAACGCTCA 444 0 

C GAG AT AG AT CTTACAAAAA ATGCTTTGAT CCACAATGGA ATCAAAGCAT T T T AAAG AG T 4 500 

TCCTCATACA TAAGCGCAGA AGTCGCAGTT CCTCTGTACT TGGCTTCTTC TCTTTTGACA 4 5 60 

AAGCGAGCCA AGTTGAGCAA CTCAGGTGCT GGATGTTTGG GATTTAGGAG CAATTCACGA 4 62 0 

TTGACCAGGC CTGAGAGACG AACTGCCTGC AATTGCTCAT TTGTAGTAGG CAGTTTTTTA 4 680 

GTAGTCTCTA GGAGAGCAGC AACTAAATCT TCACTCAAAT CATGTCGAGC ATGATTGTAA 4740 

AGATCTTTTA TAAGGCTTTC TAGGTTTGGT TCTACCATCC CTACCACCTC CCTTATGGTT 4 800 

TAATAATGTT TAATCAAATC AACCGTTGAA CGATCCAATT TCTTCACCAA GGCTTGTAAG 4 860 

AAAGCTTGCG CTTCTAGGAA GTCATCCATT GCATAGAGGG TTTGGTGAGA ATGGATATAA 4 92 0 

CGAGCGCAGA CACCGATAGT TGTTGATGGG ACACCACCAT T T TT C AG AT G AGCTGCACCT 4 980 

GCATCTGTTC CGCCTTTACC ACAGTAGTAT TGGTACTTGA TACCAGCTTC TTCAGCCGTT 504 0 

GTCAAAAGGA AATCCTTCAT CCCTGGGAGA AGCAAGTGAC CTGGATCATA GAAACGAATC 5100 

AAGGTTCCAT CTCCAATCTT GCCT TGACC A CCGTAGACAT CACCTGCTGG TGAGCAATCA 5160 

ACTGCGAGGA AGACTTCTGG GTCAAACTTG GTTGTAGAGG TATGAGCGCC ACGCAGACCA 5220 

ACTTCTTCTT GGACGTTAGA ACCCAGATAG AGTTCATTGC CGAGTTTTTG ACCCGATAAA 52 80 
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GCTTCAGCTA GCTCGCTTAC CATGAGGACA CCGTAGCGGT TATCCCAAGC TT T TG AG AT G 5340 

ATATTTTTTT CATTGGCTGT CAAAATTGCA GAACTATCTG GTACAATGGT AT C AC C AGG A 5400 

CGGATGCCAA AACTTTCTGC CTCAGCCTTG TCCGCAAAAC CACCATCAAA AACGATATCG 5460 

GCAATGGCTG GCATGGTTGG TCCCCCCTTT CCACGAGTCA AATGCGGAGG AACAGAACCT 5520 

G AAAT C AC AG GAATTTCATG AC CAT C ACG A GTCAAGAGTT TGAAACGTTG GCTGCTAACC 55 80 

ACCATGGGGT TCCAGCCACC GATTTCTACG ACACGGAAGG TACCATCTGG CTTGATTTCG 5640 

CTGACCATAA AACCAACTTC GTCCATATGA GAAGCGACCA AGACGCGCGG TGCATCCACA 5700 

GCTTCTGAAT GTTTGATACC AAAAAT AC C A CCCAAGCCAT CTGTCACCAC TTCATCCACA 57 60 

TGCGGTGTCA ACTTTTCACG AAGATAAGCA CGGACAGGCG CTTCATGACC TGAGACTGCA 582 0 

GCAAGTTCTG TTACTTCTTT AATTTTTGAA AATAATGTTG TCATTTCAGT TCCTTCTTTC 5880 

TTTCATCCAT TT T AC CAC T T T TT AT AGG AG AAGGATAGTG GGAAGGTGGA TTTCTAAGTT 5 940 

AGT AT C T TAG TCCTGCTCTA TCTTAGAAAA GGATAGTATT CTCTTGCATG TAGTGCAAAA 600 0 

TCTAGTAAAC ATTCCAAAAT TAACTCGAAT ATTTATTTCC AAACAAAAAA AC AAT AC AC C 60 60 

ATCAAAGTTG TTTGGATTTT TCATGAAATT TACAGAAAAT AGTTGACTTC CCTTTCTTCT 612 0 

TTCTTTAAAT ATATAGTTGG TTGAGTTTGG AATAGTACGC TGTAGCTGCT AAAACATTTC 618 0 

TAGAAATTAA TTTGACTTTC CTAATAGAGT TGTTCATATC TTATTTCAAT TTACTATAGT 62 40 

ACAAAACTAG AAAAGGAAAA AATCATGACC AGG 62 7 3 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28171 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 22: 

ACAACCTTTT TCAAAAACTC ACCTTGGTAC GGAGATGTTT TGCTTTCTGC TATTATTTTC 60 

GGTTATATTC ATATCAATTT TGCTTTAACT CCTCTTGCTT TTTTCATTTA TGCTAGTGGA 12 0 

GGTCTTATTT TAGCTCTATT GTATCGCATG AC T AAAAAT C TCTACTATCC AATACTAGTT 180 

CATATTCTCA TT AATAT CAC TGCCTTCTGG GATGTGTGGT TGCTCCTATT TTCAGGAAGT 24 0 

TAGCTTACTA AAATAATGTC GGAACTTTCC GGCATTTTCT TTTTTCACAA ATAGTCAACG 300 

TTTTTCTTTT CGATATTGTA GTGGTGTGTA TCCAGTTATT TTTTTGAATT GATTTTGAAA 3 60 
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AT AAGGT T G A CTTGAGAAAG GCAGATAGTG AAGATAGTTA AGAAGAATAG GATGTTCTTT 42 0 

TTTCCTTTTT GGAAAACTTC TAAAATATGG T AT AAT G AAA AGATAAAGAA GTTGGGGGTA 480 

GAAGATGAAC AT TC AAC AAT TACGCTATGT TGTGGCTATT GCCAATAGTG GTACTTTTCG 540 

TGAAGCTGCT GAAAAGATGT ATGTTAGTCA GCCGAGTCTG TCTATTTCTG TTCGTGATTT 600 

GGAAAAAGAG TTGGGCTTTA AGATTTTCCG TCGGACCAGC TCAGGGACTT TCTTGACCCG 660 

TCGTGGGATG GAATTTTATG AAAAATCGCA AGAATTGGTT AAAGGATTTG ATATTTTTCA 72 0 

AAATCAGTAT GCCAATCCTG AAGAAGAAAA AGATGAATTT TCTGTTGCTA GC C AG C ACT A 780 

TGACTTCTTG CCACCAACTA TTACGGCCTT TTCAGAGCGC TATCCTGACT ATAAGAACTT 84 0 

CCGTATTTTT GAATCAACTA CTGTTCAAAT ATTAGATGAA GTGGCGCAAG GGCATAGTGA 900 

GATTGGGATT ATCTACCTCA ACAATCAAAA TAAAAAGGGG ATTATGCAAC GGGTTGAAAA 960 

ATTAGGTCTG GAGGTCATCG AATTGATTCC TTTCCATACC CATATTTATC TCCGTGAGGG 102 0 

TCATCCTTTA GCCCAGAAAG AGGAATTAGT CATGGAGGAT TTAGCGGATT TACCAACGGT 10 80 

TCGTTTCACT CAAGAGAAAG ACGAGTACCT TTATTATTCA GAGAACTTTG T CG AT AC C AG 114 0 

CGCTAGCTCA CAGATGTTTA ATGTGACAGA CCGTGCCACC TTGAATGGTA TTTTGGAGCG 12 00 

GACGGACGCC T ATGC G AC AG GTTCTGGATT TTTAGATAGT GACAGTGTTA ATGGCATTAC 12 60 

AGTTATTCGT CTCAAGGATA ACCTAGATAA CCGCATGGTC TATGTTAAAC GTGAAGAAGT 132 0 

GGAGCTTAGT CAAGCTGGGA CTCTCTTCGT AGAAGTCATG CAAGAATATT TTGATCAAAA 13 8 0 

GAGGAAATCA TGAAAAAAAG AGCAATAGTG GCAGTCATTG TACTGCTTTT GATTGGGCTG 1440 

GATCAGTTGG TCAAATCCTA TATCGTCCAG CAGATTCCAC TGGGTGAAGT GCGCTCCTGG 1500 

ATCCCCAATT TCGTTAGCTT GACCTACCTG CAAAATCGAG GTGCAGCCTT TTCTATCTTA 15 60 

CAAGATCAGC AGCTGTTATT CGCTGTCATT ACTCTGGTTG TCGTGATAGG TGCCATTTGG 162 0 

TATTTACATA AACACATGGA GG ACT C ATT C TGGATGGTCT TGGGTTTGAC T C T AAT AAT C 1680 

GCGGGTGGTC TTGGAAACTT TATTGACAGG GTCAGTCAGG GCTTTGTTGT GGATATGTTC 174 0 

CACCTTGACT TTATCAACTT TGCAATTTTC AATGTGGCAG AT AGCT AT C T GACGGTTGGA 1800 

GTGATTATTT TATTGATTGC AATGCTAAAA GAGGAAATAA AT GG AAATT A AAATTGAAAC 1860 

TGGTGGTCTG CGTTTGGATA AGGCTTTGTC AGATTTGTCA GAATTATCAC GTAGTCTCGC 1920 

GAATGAACAA ATTAAATCAG GCCAGGTCTT GGTCAATGGT CAAGTCAAGA AAGCTAAATA 1980 

CACAGTCCAA GAGGGTGATG TCGTCACTTA CCATGTGCCA GAACCAGAGG TATTAGAGTA 2040 

TGTGGCTGAG GATCTTCCGC TAGAAATAGT CTACCAAGAT GAGGATGTGG CTGTCGTTAA 2100 

CAAACCTCAG GGAATGGTTG TGCACCCGAG TGCTGGTCAT ACCAGTGGAA CCCTAGTAAA 2160 
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TGCCCTCATG TAT CAT ATT A AGG ACT TGT C GGGTATCAAT GGGGTTCTGC GTCCAGGGAT 222 0 

TGTTCACCGT ATTGATAAGG ATACGTCAGG TCTTCTCATG ATTGCTAAAA ACGATGATGC 22 80 

GCATCTAGCA CTTGCCCAAG AACTCAAGGA TAAAAAGTCT CTCCGCAAAT ATTGGGCGAT 2 340 

TGTTCATGGA AATCTACCTA AT G ATCGTGG TGTAATTGAA GCGCCGATTG GCCGGAGTGA 24 00 

AAAAGACCGT AAGAAACAGG CTGTAACTGC TAAAGGGAAG CCTGCAGTGA CGCGTTTTCA 2 4 60 

CGTCTTGGAA CGCTTTGGCG ATTATAGCTT AGTAGAGTTG CAACTGGAGA CAGGGCGCAC 2 52 0 

TCATCAAATC CGTGTCCACA TGGCTTATAT CGGCCATCCA GTCGCTGGTG ATGAGGTCTA 2580 

TGGTCCTCGC AAGACTTTGA AAGG AC AT GG ACAATTTCTT CATGCCAAGA CTTTAGGTTT 2 640 

TACT CAT CCG AGAACAGGTA AGACCTTGGA ATTTAAAGCA GAT AT C C C AG AGATTTTTAA 27 00 

GGAAACCTTG GAGAGATTGA GAAAGTAAGA ATGAAAAAGA AATTAACTAG TTTAGCACTT 27 60 

GTAGGCGCTT TTTTAGGTTT GTCATGGTAT GGGAATGTTC AGGCTCAAGA AAGTTCAGGA 2 820 

AATAAAATCC ACTTTATCAA TGTTCAAGAA GGTGGCAGTG ATGCGATTAT TCTTGAAAGC 2 880 

AATGG AC AT T TTGCCATGGT GGATACAGGA GAAGATTATG ATTTCCCAGA TGGAAGTGAT 2 94 0 

TCTCGCTATC CATGGAGAGA AGGAATTGAA ACGTCTTATA AGCATGTTCT AACAGACCGT 3 000 

GTCTTTCGTC GTTTGAAGGA ATTGGGTGTC CAAAAACTTG ATTTTATTTT GGTGACCCAT 3060 

ACCCACAGTG ATCATATTGG AAATGTTGAT GAATTACTGT CTACCTATCC AGTTGACCGA 312 0 

GTCTATCTTA AGAAATATAG TGATAGTCGT ATTACTAATT CTGAACGTCT ATGGGATAAT 3180 

CTGTATGGCT ATGATAAGGT TTTACAGACT GCTGCAGAAA AAGGTGTTTC AGTTATTCAA 324 0 

AAT AT C AC AC AAGGGGATGC TCATTTTCAG TTTGGGGACA TGGATATTCA GCTCTATAAT 3 3 00 

TATGAAAATG AAACTGATTC ATCGGGTGAA TTAAAGAAAA TTTGGGATGA CAATTCCAAT 33 60 

TCCTTGATTA GCGTGGTGAA AGTC AATGG C AAGAAAATTT ACCTTGGGGG CGATTTAGAT 342 0 

AATGTTCATG GAGCAGAAGA CAAGTATGGT CCTCTCATTG GAAAAGTTGA TTTGATGAAG 3 4 80 

TT T AAT CAT C AC C ATG AT AC CAACAAATCA AATACCAAGG ATTTCATTAA AAATTTGAGT 3 540 

CCGAGTTTGA TTGTTCAAAC TTCGGATAGT CTACCTTGGA AAAATGGTGT TGATAGTGAG 3 600 

TATGTTAATT GGCTCAAAGA ACGAGGAATT GAGAGAATCA ACGCAGCCAG CAAAGACTAT 3 6 60 

GATGCAACAG TTTTTGATAT TCGAAAAGAC GGTTTTGTCA ATATTTCAAC ATCCTACAAG 372 0 

CCGATTCCAA GTTTTCAAGC TGGTTGGCAT AAGAGTGCAT ATGGGAACTG GTGGTATCAA 3780 

GCGCCTGATT CTACAGGAGA GTATGCTGTC GGTTGGAATG AAATCGAAGG TGAATGGTAT 3840 

TACTTTAACC AAACGGGTAT CTTGTTACAG AAT C AAT GG A AAAAATGGAA CAATCATTGG 3900 
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TTCTATTTGA 


CAGACTCTGG 


TGCTTCTGCT 


AAAAAT T GG A 


AGAAAATCGC 


TGGAATCTGG 


3960 


TATTATTTTA 


ACAAAGAAAA 


CCAGATGGAA 


ATTGGTTGGA 


TTCAAGATAA 


AGAGCAGTGG 


4020 


TATTATTTGG 


ATGTTGATGG 


TTCTATGAAG 


ACAGGATGGC 


TTCAATATAT 


GGGGCAATGG 


4080 


TATTACTTTG 


CTCCATCAGG 


GG AAAT G AAA 


ATGGGCTGGG 


TAAAAGATAA 


AGAAACCTGG 


4140 


TACTATATGG 


ATTCTACTGG 


TGTCATGAAG 


ACAGGTGAGA 


T AG AAGTTG C 


TGGTCAACAT 


4200 


TATTATCTGG 


AAGATTCAGG 


AGCTATGAAG 


CAAGGCTGGC 


ATAAAAAGGC 


AAATGATTGG 


4260 


TATTTCTACA 


AGACAGACGG 


TTCACGAGCT 


GTGGGTTGGA 


TCAAGGACAA 


GGATAAATGG 


4320 


TACTTCTTGA 


AAGAAAATGG 


TCAATTACTT 


GTGAACGGTA 


AGACACCAGA 


AGGTTATACT 


4380 


GTGGATTCAA 


GTGGTGCCTG 


GTTAGTGGAT 


GTTTCGATCG 


AGAAATCTGC 


TACAATTAAA 


4440 


ACTACAAGTC 


ATT C AG AAAT 


AAAAGAATCC 


AAAGAAGTAG 


T G AAAAAGG A 


TCTTGAAAAT 


4500 


AAAGAAACGA 


GTCAACATGA 


AAGTGTTACA 


AATTTTTCAA 


CTAGTCAAGA 


TTTGACATCC 


4560 


TCAACTTCAC 


AAAGCTCTGA 


AACGAGTGTA 


AACAAATCGG 


AATCAGAACA 


GTAGTAGAAA 


4620 


AGAAGGTTTT 


AGGGCCTTCT 


TTTTCCTATC 


AACTCTTTTC 


TATTTCCTGT 


TATTCATGTT 


4680 


ATAATGGATA 


AATATGAATA 


ATCGGAGTGA 


GACTATGAAA 


TACAAACGGA 


TTGTCTTTAA 


4740 


GGTGGGTACT 


TCTTCTCTGA 


CAAATGAGGA 


TGGAAGTTTA 


TCACGTAGTA 


AGGTAAAGGA 


4800 


TATTACCCAG 


CAGTTGGCTA 


TGCTGCACGA 


GGCTGGTCAT 


GAGTTGATTT 


TGGTGTCTTC 


4860 


AGGTGCCATT 


GCGGCTGGTT 


TTGGAGCCTT 


AGGATTTAAA 


AAGCGTCCGA 


CTAAGATTGC 


4920 


TGATAAACAG 


GCTTCAGCAG 


CGGTAGGGCA 


GGGGCTTTTG 


TTGGAAGAAT 


ATACAACCAA 


4980 


TCTTCTCTTG 


CGTCAAATCG 


TTTCTGCACA 


AATCTTGCTG 


AC C CAAG AT G 


ACTTTGTGGA 


5040 


TAAGCGTCGT 


TATAAAAATG 


CCCATCAGGC 


TTTGTCGGTT 


TTGCTCAACC 


GTGGGGCAAT 


5100 


TC C TAT CATC 


AATGAGAATG 


ATAGTGTCGT 


TATTGATGAG 


CTCAAGGTTG 


GGGACAATGA 


5160 


CACTCTAAGT 


GCTCAAGTAG 


CGGCGATGGT 


C C AAGC AG AC 


CTTTTAGTTT 


TCTTGACAGA 


5220 


TGTGGACGGT 


CTCTATACTG 


GAAATCCTAA 


TTCAGATCCA 


AGAGCCAAAC 


GCTTGGAGAG 


5280 


AATCGAGACC 


ATCAATCGTG 


AG ATT AT TG A 


TATGGCTGGT 


GGAGCTGGTT 


CGTCAAACGG 


5340 


AACTGGGGGT 


ATGTTAACCA 


AAATCAAGGC 


TGCAACTATC 


GCGACGGAAT 


CAGGAGTTCC 


5400 


TGTTTATATC 


TGCTCATCCT 


TG AAAT C AGA 


TTCCATGATT 


GAGGCGGCAG 


AGGAGACCGA 


5460 


GGATGGTTCT 


TACTTTGTTG 


CTCAAGAGAA 


GGGGCTTCGT 


ACCCAGAAAC 


AATGGCTTGC 


5520 


CTTCTATGCT 


CAGAGTCAAG 


GTTCTATTTG 


GGTTGATAAA 


GGGGCTGCGG 


AAGCTCTCTC 


5580 


TCAATATGGA 


AAGAGTCTTC 


TCTTATCTGG 


TATCGTTGAA 


GCAGAAGGAG 


TCTTTTCTTA 


5640 


CGGTGATATC 


GTGACAGTAT 


TTGACAAGGA 


AAGTGGAAAA 


TCACTTGGAA 


AAGGACGCGT 


5700 
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GCAATTTGGA GCATCTGCTT TGGAGGATAT GTTGCGTTCT CAAAAAGCCA AGGGTGTCTT 57 60 

GATTTACCGT GACGACTGGA TTTCCATTAC TCCTGAAATC CAACTACTTT TTACAGAATT 582 0 

TTAGAGGTAA ACTATGGTGA GTAGACAAGA ACAATTTGAA CAGGTACAGG CTGTTAAAAA 5 880 

ATCGATTAAC ACAGCTAGTG AAGAAGTGAA AAACCAAGCC TTGCTAGCCA TGGCTGATCA 5940 

CTTAGTGGCT GCTACTGAGG AAATTTTAGC GGCTAATGCC CTCGATATGG CAGCGGCTAA 6000 

GGGGAAAATC TCAGATGTGA TGTTGGATCG TCTTTATTTG GATGCAGATC GTATAGAAGC 6060 

GATGGCAAGA GGAATTCGTG AAGTGGTTGC CTTACCAGAT CCAATCGGTG AAGTTTTAGA 6120 

AACAAGTCAG CTTGAAAATG GTTTGGTTAT CACAAAAAAA CGTGTAGCTA TGGGTGTCAT 6180 

CGGTATTATC TAT G AAAGCC GTCCAAATGT GACGTCTGAT GCGGCTGCTT TGACTCTTAA 62 4 0 

GAGTGGAAAT GCGGTTGTTC TTCGTAGTGG TAAGGATGCC TATCAAACAA CCCATGCCAT 63 00 

TGTCACAGCC TTGAAGAAGG GCTTGGAGAC G ACT AC TAT T CATCCAAATG TGATTCAACT 63 60 

GGTGGAGGAT ACTAGCCGTG AAAGTAGTTA TGCTATGATG AAGGCCAAGG GCTATCTAGA 642 0 

CCTTCTCATT CCTCGTGGAG GAGCTGGCTT GATCAATGCA GTGGTTGAGA ATGCGATTGT 6480 

ACCTGTTATC GAGACAGGGA CTGGGATTGT CCATGTCTAT GTGGATAAGG ATGC AG AC G A 6540 

AG AC AAGG CG CTGTCTATCA TCAACAATGC TAAAACCAGT CGTCCTTCTG TTTGTAATGC 6600 

CATGGAGGTT CTGCTGGTTC ATGAAAACAA GGCAGCAAGC TTCCTTCCTC GCTTGGAGCA 6 6 60 

AGTGTTGGTT GCAGAGCGTA AGGAAGCTGG ACTGGAACCA ATTCAATTCC GCCTAGATAG 67 2 0 

CAAAGCAAGC CAGTTTGTTT CAGGTCAAGC AG CTG AG AC C CAAGACTTTG ACACCGAGTT 67 8 0 

TT TAG ACT AT GTCCTTGCTG TTAAGGTTGT GAGCAGTTTA GAAGAAGCGG TTGCGCACAT 684 0 

TGAATCCCAC AGCACCCATC ATTCGGATGC TATTGTGACG GAAAATGCTG AAGCTGCAGC 6 9 00 

ATACTTTACA GATCAAGTGG ACTCTGCAGC GGTGTATGTT AATGCCTCAA CTCGTTTCAC 69 60 

AGATGGAGGA CAATTTGGTC TTGGTTGTGA AATGGGGATT TCTACTCAGA AATTGCACGC 7 02 0 

GCGTGGTCCC ATGGGCTTGA AAGAGTTGAC CAGCTACAAG TATGTGGTTG CCGGTGATGG 708 0 

GCAGATAAGG GAGTAAGAGA TGAAGATTGG ATTTATCGGT TTGGGGAATA TGGGTGCTAG 7140 

CTTGGCAAAA TCTGTCTTGC AGACTAGGAC GT C AG AT GAG ATTCTCCTTG CCAATCGTAG 72 00 

TCAAGCTAAG GTAGATGCTT TCATTGCAGA CTTTGGTGGT CAGGCTTCCA GCAATGAAGA 72 60 

AATGTTTGCA GAAGC AG AT G TGATTTTTCT AGGAGTTAAG CCTGCTCAGT TTTCTGAACT 7320 

GCTTTCTCAA TACCAGACCA TCCTTGAAAA AAGAGAAAGT CTTCTTTTGA TTTCGATGGC 7 3 80 

AGCTGGATTG ACCTTAGAAA AACTAGCAAG TCTTATCCCA AGTCAACACC GAATTATTCG 744 0 
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TATGATGCCT AATACCCCTG CTTCTATCGG GCAAGGAGTG ATTAGTTATG CCTTGTCTCC 7 500 

TAATTGCAGG GCTGAGGACA GTGAGCTCTT TTATCAGCTT TTAGCCAAGG CTGGTCTCTT 75 60 

GGTTGAACTA GGAGAAAGTT TAATCGATGC AG CG AC AGGT CTTGCAGGTT GTGGACCAGC 7 62 0 

CTTTGTCTAT CTTTTTATCG AGGCCTTGGC AGATGCAGGT GTTCAGACAG GATTACCACG 7 6 80 

AGAAAT AG C A TTGAAAATGG CAGCACAAAC TGTGGTAGGA GCTGGGCAAT TGGTCCTTGA 774 0 

AAGTCAGCAA CATCCTGGAG TATTGAAAGA CCAAGTCTGT AGCCCAGGCG GT T CG ACT AT 7 800 

CGCTGGTGTA GCAAGCCTAG AAGCGCATGC TTTCCGAGGA ACAGTCATGG ATGCAGTTCA 7 8 60 

TCAAGCCTAC AAACGAACAC AAGAACTAGG TAAATAAGAG GTAGTTTTGA CTGCCTCTTT 7 92 0 

TATGGTGGCT GAAATGAGAA GACACAAAAA GATTGTCACA AACCCCTATT T T TT T GAT AG 7980 

AATAGAAGTA GTAAAAAAGA AATGAGTTAG ACATGTCAAA AGGATTTTTA GTCTCTCTTG 804 0 

AGGGACCAGA GGGAGCAGGC AAGACCAGTG TTTTAGAGGC TCTGCTACCA ATTTTAGAGG 810 0 

AAAAAGGAGT AGAGGTGTTG ACGACCCGTG AACCTGGCGG AGTCTTGATT GGGGAGAAGA 8160 

TTCGGGAAGT GATTTTGGAT CCAAGTCATA CTCAGATGGA TGCTAAAACA GAG C T ACT T C 822 0 

TCTATATTGC CAGTCGCAGA CAGCATTTGG TGGAAAAAGT TCTTCCAGCC CTTGAAGCTG 82 80 

GCAAGTTGGT CATCATGGAT CGTTTTATCG ATAGTTCTGT TGCCTATCAG GGATTTGGTC 8340 

GTGGCTTAGA TATTGAAGCC ATTGACTGGC TCAATCAGTT TGC G AC AG AT GGCCTCAAAC 8400 

CCGATTTGAC ACTCTATTTT G AC AT CG AGG TGGAAGAAGG GCTGGCTCGT ATTGCTGCTA 84 60 

ATAGTGACCG CGAGGTTAAT CGTTTGGATT TGGAAGGGTT GGACTTGCAT AAAAAAGTTC 8520 

GTCAAGGCTA CCTTTCTCTT CTGGATAAAG AGGGAAATCG CATTGTCAAG ATTGATGCTA 8 5 80 

GTCTCCCTTT GGAGCAAGTT GTGGAAACTA CCAAGGCTGT CTTGTTTGAC GGAATGGGCT 8 64 0 

TGGCCAAATG AAACAAGATC AACTAAAGGC TTGGCAACCA GCTCAGTTTG ACCGTTTTGT 87 00 

CCGTATCTTA GAACAAGACC AGCTCAATCA CGCCTATCTC TTTTCAGGTT TCTTTGAAAG 87 6 0 

CTTGGAAATG GCGCAATTTT TAGCTAAGAG CCTCTTTTGT ACGGATAAAG TTGGCGTCTT 8820 

ACCATGTGAG AAATGCCGAA GTTGCAAGCT GATTGAACAG GGAGAATTTC CCGATGTCAC 8880 

C TTGATT AAA CCAGTTAATC AGGTCATTAA GACGGAACGC ATTCGAGAAT TGGTGGGTCA 8940 

GTTTTCTCAA GCAGGGATTG AAAGCCAGCA ACAGGTCTTT ATCATCGAGC AAGCGGATAA 9000 

AATGCATCCC AACGCAGCCA ATTCTCTGCT CAAGGTCATC GAAGAACCCC AGAGTGAAGT 9060 

TTATATTTTC TTCTTGACTA GCGATGAGGA AAAGATGTTA CCGACAATCC GAAGTCGGAC 9120 

TCAGATCTTC CACTTTAAAA AG C AAGAAG A AAAACTTATC TTACTCTTAG AACAAATGGG 9180 

ACTTGTTAAG AAAAAAGCGA CTCTTTTAGC TAAGTTTAGT CAATCGCGAG CTGAAGCAGA 9240 
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AAAGTTGGCT AATCAGGCAA GTTTTTGGAC CTTGGTCGAT GAAAGTGAAC GCCTGCTGAC 93 00 

TTGGTTAGTA GCTAAGAAAA AAGAAAGTTA TCTACAGGTT GCCAAATTAG CCAACTTGGC 93 60 

AGATGATAAG GAAAAACAGG ATCAGGTTTT ACGGATTCTT GAAGTTCTCT GTGGGCAGGA 942 0 

CCTCTTGCAG GTAAGAGTAA GAGTGATTCT ACAAGATTTA CTAGAAGCTA G AAAAAT GTG 9480 

GCAAGCTAAT GTCAGCTTTC AAAATGCCAT GGAATATCTG GTCTTGAAAG AAATATAAAC 9540 

TCAAAAATGA ATGATAAAGA AAGGAAAGGG CTGTTTTATG GACAAAAAAG AATTATTTGA 9 600 

CGCGCTGGAT GATTTTTCCC AACAATTATT GGTAACCTTA GCCGATGTGG AAGCCATCAA 9660 

G AAAAAT CT C AAGAGCCTGG TAGAGGAAAA TACAGCTCTT CGCTTGGAAA ATAGTAAGTT 972 0 

GCGAGAACGC TTGGGTGAGG TGGAAGCAGA TGCTCCTGTC AAGGCCAAGC ATGTTCGTGA 97 8 0 

AAGTGTCCGT CGCATTTACC GTGATGGATT TCACGTATGT AATGATTTTT ATGGACAACG 9840 

TCGAGAGCAG GACGAGGAAT GTATGTTTTG TGACGAGTTG CTATACAGGG AGTAGGCATG 9 9 00 

CAGATTCAAA AAAGTTTTAA GGGGCAGTCT CCCTATGGCA AGCTGTATCT AGTGGCAACG 9 9 60 

CCGATTGGCA ATCTAGATGA TATGACTTTT CGTGCTATCC AGACCTTGAA AGAAGTGGAC 10020 

TGGATTGCTG CTGAGGATAC GCGCAATACA GGGCTTTTGC TCAAGCATTT TGACATTTCC 10080 

ACCAAGCAGA TCAGTTTTCA TGAGCACAAT G C C AAGG AAA AAATTCCTGA TTTGATTGGT 10140 

TTCTTGAAAG CAGGGCAAAG TATTGCTCAG GTCTCTGATG CCGGTTTGCC TAGCATTTCA 10200 

GACCCTGGTC ATGATTTAGT TAAGGCAGCT ATTGAGGAAG AAATTGCAGT TGTGACAGTT 10260 

CCAGGTGCCT CTGCAGGAAT TTCTGCCTTG ATTGCCAGTG GTTTAGCGCC AC AG C C AC AT 10320 

ATCTTTTACG GTTTTTTACC G AGAAAAT C A GGTCAGCAGA AGCAATTTTT TGGCTTGAAA 103 80 

AAAGATTATC CTGAAACACA GATTTTTTAT GAATCACCTC ATCGTGTAGC AGACACGTTG 10440 

GAAAATATGT TAGAAGTCTA CGGTGACCGC TCCGTTGTCT TGGTCAGGGA ATTGACCAAA 10500 

ATCTATGAAG AATACCAACG AGGTACTATC TCTGAGTTAT TAGAAAGCAT TGCTGAAACG 105 60 

CC AC TC AAGG GCGAATGTCT TCTCATTGTT GAGGGTGCCA GTCAGGGTGT GGAGGAAAAG 10620 

GACGAGGAAG ACTTGTTCGT AGAAATTCAA ACCCGCATCC AGCAAGGTGT GAAGAAAAAC 10680 

CAAGCTATCA AGGAAGTCGC TAAGATTTAC CAGTGGAATA AAAGTCAGCT CTACGCTGCC 10740 

TACCACGACT GGGAAGAAAA ACAATAAAGG GAGACAGGAT GTAATAATTC TGTCTGTTTC 10800 

TGTTTAACTT AATTAGTGAT GATAATATAA AGATGTATCA CTTGGTATAG AAGCTTTGGT 10 860 

ATTAAGTTTT TTATTAAGCC CATACGGAAT ACCGATGGTT GGAGCAGCAG TTATAGCGTT 1092 0 

CTTAGAAGGT ATAAATAGAA AAATAAGGTC ATTTTAAATC AAAGGATTGA TAAATCAGAA 10980 
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AG AAG GTG AT TTTTTGCGAA CATACGAAAA TAAAGAAGAA CTAAAAGCTG AGATAGAGAA 11040 

AACATTTGAG AAAT AT AT T T TAGAATTTGA TAATATTCCA GAAAATTTAA AAGATAAGAG 11100 

AGCTGATGAA GTTGACAGAA CTCCAGCAGA AAACCTTGCT TATCAGGTTG GTTGGACCAA 11160 

CTTGGTTCTT AAATGGGAAG AAGATGAAAG AAAGGGGCTT CAAGTAAAAA CACCATCGGA 112 2 0 

TAAATTTAAA TGGAATCAAC TTGGTGAATT ATATCAGTGG T T C AC AG AT A CCTACGCTCA 112 80 

TTTATCTCTG CAAGAGTTGA AAGCAAAATT AAATGAAAAT ATTAATTCTA TCTCTGCAAT 11340 

GATTGATTCG TTGAGTGAGG AAGAATTATT TGAACCGCAT ATGAGAAAGT GGGCTGATGA 114 00 

AGCGACTAAA ACAGCGACTT GGGAAGTGTA TAAGTTTATT CATGTAAATA CGGTTGCACC 114 60 

TTTTGGAACT TTCAGAACTA AAAT C AG AAA ATGGAAGAAG ATAGTATTAT AAATTATATT 1152 0 

TTTAACTTTA AAAAATTTCA TAAAAATGGT TACCAAAGGC GATAGAAGAA AAACTATCGT 115 8 0 

CTTTTTCTTT GCAAATTTTT AAGAAGGGAG GTGATCTTGC ATGGACTTTG AATATTTTTA 11640 

TAACAGAGAA G CGGAAAG AT TTAACTTCTT AAAAGT AC C G GAGATATTAG TTGATAGAGA 117 00 

AGAATTTCGG GGCTTATCAG CAGAAGCAAT TATCCTTTAT TCCATACTTC TTAAACAGAC 117 60 

AGGAATGTCA TTTAAGAATA ACTGGATAGA CAAGGAAGGC AGAGTATTTA TCTATTTTAC 11820 

TGTCGAAGAA ATTATGAAAA GAAGAAATAT CTCAAAGCCA ACTGCCATAA AAACATTAGA 11880 

TGAGCTTGAT GTAAAAAAGG AATAGGACTG AT CG AAAG AG TAAGGCTTGG ACTTGGTAAG 11940 

CCGAACATCA TTTATGTTAA AGACTTTATG AGTATATTTC AGGTAAAAGA AAATGACTTA 12000 

CAGAAGTCAA AAAACTTAAC TTCAGAAGTA AAAGATTTTA AC C TC AG AAG TAAAGAAAAT 12060 

GAACTTCAAG AGGTTAAGAA CCTTGACTCT AACTATATAG AGAATAATAA GAGTAAGTAT 12120 

AGTAAGAGAG AATATAGTTT TGGTGAAAAC GGACTTGGAA CATTTCAAAA TGTGTTTTTA 12180 

GCTGCTGAAG ATATATCGGA TTT AC AAAT C ATAATGAACT CACAGCTTGA GAATTACATT 12 240 

AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA ATGCCAGCAA TCAAATTCAT 12 300 

TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGATTGTT GAAAACATTT TAAACGTTTT 12 3 60 

TACTTTGGCA AAGATGTTCT CAATCTTGCT TCTCTCCTTG GATAGCGCAT GGTTACAGGC 12420 

TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA CGTGGAGTTT GTACTTGAGG 124 80 

ATATATCTTC ATGAGCCCTT GATAACCACT GT C AG AC AAG ATTTTACCAG CTTGTCCGAT 12540 

ATTTCTGCGA C TC AT T T TG A ACAACTTCAT ATCACGACAA TAGTTCACAG CGATATCCAA 12 600 

AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC TTCATAGCGT GAAATTTCTT 12 660 

TTTACCAGAA TGATTCGCTA ATTCTTTTTT TAGGGCGATT GATTTTTACT TCCGTCGCAT 12720 

CAATCATTAC CGTGT C CTC A GAACTGAGAG GAGTTCTTGA AATCGTAACA CCACTTTGAA 127 8 0 
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CAAGAGTTAC TTCAACCCAT TGGCTCCGAC GGATTAAGTT GCTTTCGTGA ATACCAAAAT 12 840 

CAGCCGCAAT TTGTTCATAA GTTCGATATT CTCGCACATA TTGAAGAGTG GCCATAAGAA 12 9 00 

GGTCTTCTAG GCTTAATTTA GGTTTTCGTC CACCTTTTGC GTGTTTAAGT TGATAAGCTG 12 9 60 

TTTTTAATAC AGCTAATATC TCTTCAAAAG TCGTGCGCTG AACACCAACA AG AC GCTT AA 13020 

ATCGTGCATC AGTTAGTTGT TTACTTGCTT CATCATTCAT AGAACTACTA TACCATATTT 13 080 

TGTTTCGCAG GAAGTCTATT GGAAAGTAAG AAATATTGAA GCTGAGGCTA TTAGAAGAAA 13140 

TTGTGAGCGT GGTGCTATTT TTTCAGGTAA AATAAAATAT CACGAAGATT CACAGTTTAA 13200 

AGGAGATCAC TATGTTGAAT GTTATGCTGT TTTAGATAAT ACGGTTATAG CAAGAGATAG 13260 

AATAACAGTC CCTATCGATC CGTTATGTGG AAAAGATTTT ATAGAGTAGC AT AT AAT TG A 13320 

TTCTTAACTG G AAT ACT C AC TATCTCTTTA CAT C AAG AAA ATGACTAAAC AGGGAAGTTT 13 3 80 

GCCTTCTTCC CTTTTTTTGT TATACTAGTA GAAGAAAAAA TTAGAAAGAT TTGTGGGTGT 1344 0 

CAAACAGCCC AGTGGGGTGT TTTAATATGG ACTTAGGTCC CACCCAAAGA GGTATTAGTG 13 5 00 

TCGTGTCTCA ATCTTATATC AATGTTATCG GTGCTGGTTT GGCAGGTTCT GAAGCAGCTT 13560 

ACCAAATCGC AGAGCGTGGT ATTCCAGTTA AACTATATGA AATGCGTGGT GTCAAGTCTA 13 62 0 

CACCCCAGCA TAAAACAGAC AATTTTGCTG AGTTGGTTTG TTCCAATTCT TTGCGTGGGG 13 6 80 

ATGCTTTGAC AAATGCAGTT GGTCTTCTCA AGGAAGAAAT GCGTCGCTTG GGTTCTGTTA 13740 

TCTTGGAATC TGCTGAGGCT AC ACGTGT T C CTGCAGGTGG TGCCCTTGCA GTGGACCGTG 13 8 00 

ATGGTTTCTC TCAAATGGTG ACCGAAAAAG TTGCCAACCA CCCCTTGATT GAAGTGGTTC 13 860 

GTGATGAAAT T AC AG AAT TG CCGACAGATG TTATTACGGT TAT C G C TACT GGTCCTTTGA 1392 0 

CAAGTGATGC CTTGGCTGAA AAGATTCATG CTCTTAATGA CGGTGCTGGT TTTTATTTCT 13 9 80 

ACGATGCGGC AGCGCCTATT ATCGATGTCA ACACTATCGA TATGAGCAAG GTCTACCTCA 14 040 

AATCACGTTA TGATAAGGGA GAAGCGGCCT ACCTCAATGC CCCTATGACC AAG C AAG A AT 14100 

TTATGGATTT CCATGAAGCT TTGGTCAATG C AG AAG AAG C ACCGCTTAGT TCTTTTGAAA 1416 0 

AAGAAAAGTA CTTTGAAGGA TGTATGCCTA TCGAAGTCAT GGCCAAACGT GGCATTAAAA 1422 0 

CTATGCTTTA TGGCCCTATG AAGCCAGTCG GTCTTGAGTA CCCAGACGAC TATACAGGAC 14 2 80 

CTCGTGATGG AG AAT TT AAA ACACCTTATG CGGTTGTGCA ACTTCGTCAG GATAATGCAG 14 34 0 

CTGGTAGCCT CTACAATATT GTTGGTTTCC AGACCCACCT CAAATGGGGA GAACAAAAGC 14400 

GTGTCTTCCA AATGATTCCG GGTCTTGAAA ATGCGGAGTT TGTCCGTTAT GGTGTGATGC 144 6 0 

ATCGCAATTC TTACATGGAT TCACCAAATC TTCTTGAGCA GACTTACCGT TCTAAGAAAC 14520 
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AACCAAATCT 


CTTCTTTGCT 


GGTCAAATGA 


CGGGTGTGGA 


AGGCTATGTT 


GAGTCGGCGG 


14580 


CTTCAGGCTT 


AGTTGCGGGA 


ATTAACGCAG 


CTCGTCTCTT 


CAAGGAAGAA 


AGCGAGGCTA 


14640 


TTTTCCCCGA 


GACGACAGCG 


ATTGGAAGCT 


TAGCTCATTA 


CATTACCCAT 


GCCGACAGCA 


14700 


AACATTTCCA 


ACCAATGAAT 


GTCAATTTTG 


GGATCATCAA 


GGAGTTGGAA 


GGCGAGCGTA 


14760 


TCCGTGATAA 


GAAGGCTCGT 


TATGAAAAAA 


TTGCAGAGCG 


TGCCCTTGCC 


GACTTAGAGG 


14820 


AATTTTTGAC 


TGTCTAATTT 


TTTTGAAAGA 


ATTGCTCATG 


AT ACT AT AAA 


AATCTTAGAA 


14880 


ATTGTGATAA 


AATAGGTAGG 


ATGAAAGAAG 


G AG AGT G AAA 


ATGGCGAATC 


CCAAGTATAA 


14940 


ACGTATTTTA 


ATCAAGTTAT 


CAGGTGAAGC 


CCTTGCCGGT 


GAACGTGGCG 


TAGGGATTGA 


15000 


TATCCAAACA 


GTTCAAACAA 


TCGCAAAAGA 


GATTCAAGAA 


GTTCATAGCT 


TAGGTATCGA 


15060 


AATTGCCCTT 


GTTATCGGTG 


GAGGAAATCT 


CTGGCGTGGA 


GAACCTGCAG 


CAGAAGCAGG 


15120 


TATGGACCGT 


GTTCAGGCAG 


ATTACACAGG 


AATGCTTGGG 


ACTGTTATGA 


ATGCTCTTGT 


15180 


GATGGCAGAT 


TCATTGCAAC 


AAGTTGGGGT 


TGATACGCGT 


GTACAAACAG 


CTATTGCCAT 


15240 


GCAACAAGTG 


GCAGAGCCTT 


ATGTCCGTGG 


ACGTGCCCTT 


CGTCACCTTG 


AAAAAGGCCG 


15300 


TATCGTTATC 


TTTGGTGCTG 


GAATTGGTTC 


ACCTTACTTC 


T CG AC AG AT A 


CAACAGCGGC 


15360 


CCTTCGTGCA 


GCTGAAATCG 


AAGCAGATGC 


CATCCTCATG 


GCTAAAAATG 


GTGTCGATGG 


15420 


TGTTTACAAT 


GCCGATCCTA 


AGAAAGATAA 


G AC AG CTGT T 


AAGTTTGAAG 


AATTGACCCA 


15480 


CCGTGACGTT 


ATCAATAAAG 


GTCTTCGTAT 


CATGGACTCA 


ACAGCTTCAA 


CCCTCTCAAT 


15540 


GGACAACGAC 


ATTGACTTGG 


TTGTATTCAA 


CATGAACCAA 


CCAGGCAACA 


TCAAACGTGT 


15600 


CGTATTTGGT 


GAAAATATCG 


GAACAACAGT 


TTCAAATAAT 


AT C G AAG AAA 


AGGAATAAGA 


15660 


AAGAATATGG 


CTAACGCAAT 


TATTGAAAAA 


GCTAAAGAGA 


GAATGACCCA 


GTCTCACCAA 


15720 


TCACTTGCTC 


GTGAATTTGG 


TGGTATCCGT 


GCTGGTCGTG 


CCAATGCAAG 


CTTGCTTGAC 


15780 


CGTGTACATG 


TAGAATACTA 


TGGAGTCGAA 


ACTCCTCTTA 


ACCAAATCGC 


TTCAATTACG 


15840 


ATTCCAGAAG 


CGCGTGTTTT 


GTTGGTAACA 


CCATTTGACA 


AGTCTTCATT 


G AAAG AC AT C 


15900 


GAACGTGCCT 


TGAACGCTTC 


T G AT AT TGGT 


ATC AC AC CGG 


CTAATGACGG 


TTCTGTGATT 


15960 


CGCTTGGTTA 


TCCCAGCTCT 


TACAGAAGAA 


ACTCGTCGTG 


ACCTTGCTAA 


AGAAGTGAAG 


16020 


AAGGTCGGCG 


AAAATGCTAA 


AGTGGCTGTC 


CGCAATATCC 


GTCGCGATGC 


TATGGACGAA 


16080 


GCTAAGAAAC 


GAGAAAAAGC 


AAAAGAAATC 


ACT G AAG ACG 


AATTGAAGAC 


TCTTGAAAAA 


16140 


G AC ATT C AAA 


AAGTAACAGA 


CGATGCTGTT 


AAACACATCG 


ACG AC AT G AC 


TGCTAACAAA 


16200 


GAGAAAGAAC 


TTTTGGAAGT 


CTAAAAATAA 


ACAGAAAAAC 


TCAGTTGGCA 


TTGCTGGCTG 


16260 


AGTTTTATTC 


GAAAGAAGGA 


AATATGAATA 


CAAATCTTGC 


AAGTTTTATC 


GTTGGACTGA 


16320 
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TCATCGATGA AAACGACCGT TTTTACTTTG TGCAAAAGGA TGGTCAAACC TATGCTCTTG 16380 

CTAAGGAAGA AGGCCAACAT ACAGTAGGGG ATACGGTCAA AGGTTTTGCA TACACGGATA 16440 

TGAAGCAAAA ACTCCGCCTG ACAACCTTAG AAGTGACTGC CACTCAGGAC CAATTTGGTT 16 5 00 

GGGGACGTGT CACAGAGGTT CGTAAGGACT TGGGTGTCTT TGTGGATACA GGCCTTCCTG 165 60 

ACAAGGAAAT CGTTGTGTCA CT C GAT AT T C TCCCTGAGCT CAAGGAACTC TGGCCTAAGA 16 620 

AGGGCG AC C A ACTCTACATC CGTCTTGAAG TGGATAAGAA AGACCGTATC TGGGGCCTCT 166 80 

TGGCTTATCA AGAAGACTTC CAACGTCTTG CTCGTCCTGC CTACAACAAC ATGCAGAACC 1674 0 

AAAACTGGCC AGCCATTGTT TACCGTCTCA AGCTGTCAGG AACTTTTGTT TACCTACCAG 16800 

AAAATAATAT GCTTGGTTTT ATTCATCCTA GCGAGCGTTA CGCAGAGCCA CGTTTGGGGC 16 8 60 

AAGTATTAGA TGCGCGCGTT ATTGGTTTCC GTGAAGTGGA CCGCACTCTG AACCTCTCCC 16920 

TCAAACCACG CTCCTTTGAA ATGTTGGAAA ACGATGCTCA GAT G ATT T T G ACTTATTTGG 16980 

AAAGCAATGG CGGTTTCATG ACCTTAAATG AC AAGTC AT C TCCAGACGAC ATCAAGGCAA 17 04 0 

CCTTTGGCAT TTCTAAAGGT CAGTTCAAGA AAGCTTTAGG TGGTCTTATG AAGGCTGGTA 17100 

AAATCAAGCA GGACCAGTTT GGGACAGAGT TGATTTAGGG AGGCTTATGA GAAAATCATT 17160 

TTACACTTGG CTCATGACCG AGCGCAATCC TAAAAGTAAC AGTCCCAAAG CAATTTTGGC 17220 

AGACCTCGCT TTTGAAGAGT CAGCCTTTCC AAAACACACA GATGATTTTG ATGAGGTCAG 172 80 

TCGCTTTTTG G AGGAGC AT G CCAGTTTCTC TTTTAACCTA GGAGATTTTG ACAGCATTTG 173 4 0 

GCAGGAATAT CTAGAACACT AG C ATT T ATT CATTGGGTTT GGGCTAGTAA TTTCTCCATC 17400 

CCTCTGCTAT AATAAAAAGA AATAAAAGGA TTAGAGAGGT TCTTTATTTG AAGGAACATT 174 60 

CAATAGACAT TCAACTGAGT CATCCAGATG ACCTGTTTCA TCTTTTTGGT TCCAATGAAC 17 52 0 

GCCATCTTCG TTTGATGGAA GAAGAGCTTG ATGTTGTGAT TCATGCTCGT ACGGAGATTG 17580 

TCCAGGTTTT GGGAGAAGAG TCTGCCTGTG AGGAAGCCCG TCAAGTTATT CAGGCTTTGA 17 640 

TGGTCTTGGT AAATCGTGGG ATGACCGTTG GTACGCCAGA TGTAGTCACT GCGATTAGCA 17700 

TGGTCAAAAA TGATGAAATT GACAAGTTTG TCGCCCTTTA CGAAGAAGAA ATTATCAAGG 177 60 

ATAATACTGG GAAACCTATC CGTGTCAAAA CCCTAGGGCA AAAGCTTTAT GTGGACAGTG 17 82 0 

TCAAACAGCA TGATGTGACC TTTGGAATTG GGCCAGCAGG TACAGGGAAG ACCTTCCTTG 17 880 

CAGTGACCTT GGCAGTGACT GCCCTTAAAC GTGGGCAAGT CAAGCGAATT ATCCTAACTC 17 94 0 

GTCCAGCGGT GGAAGCGGGA GAGAGTCTTG GATTTCTTCC GGGTGATCTT AAGGAGAAGG 180 00 

TGGATCCTTA CCTTCGTCCT GTTTACGATG CCTTGTATCA AATTCTTGGG AAAG AC C AAA 180 60 
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CGACTCGTCT 


CATGGAGCGT 


GAAATTATCG 


AAATTGCGCC 


CCTTGCCTAT 


ATGCGTGGCC 


18120 


GGACCTTGGA 


TGATGCCTTT 


GTCATTCTCG 


ATGAGGCGCA 


AAACACGACC 


ATCATGCAGA 


18180 


TGAAGATGTT 


CTTGACGCGT 


TTAGGTTTTC 


ATTCTAAGAT 


GATTGTCAAT 


GGAGAT AT TA 


18240 


GTCAGATTGA 


CCTGCCACGT 


AATGTCAAGT 


CCGGTTTGAT 


TGATGCTCAA 


GAGAAACTCA 


18300 


AGAACATCCA 


TCAGATTGAC 


TTTGTTCATT 


TTTCAGCCAA 


GGATGTGGTT 


CGCCATCCTG 


18360 


TTGTCGCTCA 


GATTATCCGA 


GCCTATGAAT 


ATTCTACTGA 


AGTTGCACAC 


GACTGATTTT 


18420 


GAGGAAGTTC 


GCCTGCAAAA 


GAATAGACTT 


GTTCGGTAAC 


TGTAAAAAGT 


GTTATACTAT 


18480 


TTTTATGGAA 


ACAGTATACG 


ACAAAGCACA 


AAAACTTAAC 


TCAAAAAACT 


TCAAACTATT 


18540 


GATTGGTGTC 


AAAAAGGAAA 


CCTTTCAACT 


CATGCTAGAA 


CACCTGAATT 


CAGCCTATCA 


18600 


GATTCAGCAC 


CGAAAAGGTG 


GACGTCCACG 


TAGTCTGCCC 


ATGGAAGACC 


AG CT C ATT AT 


18660 


GACCCTCCGT 


TACTTGCGAT 


ATTATCCCAC 


TCAGCGTCTG 


CTGGCCTTTG 


ATTTTGGCGT 


18720 


CGGTGTAGCT 


ACGGTAAATG 


CCATCATCAC 


TTGGGTGGAG 


GATACACTTC 


GTGCGTCAGG 


18780 


TAGCTTTGAT 


TTGGACCATT 


TAGAAGCCCC 


GAGTGCTGCT 


GTGGCTATTG 


ACGTG AC C G A 


18840 


AAGTCCGATT 


CAGCGTCCAA 


ACAAAACCAA 


AGCAAAAATT 


ATTCTGGTAA 


AAAGAAACGA 


18900 


CACACCTTAA 


AAACTCAAAT 


TATGCTGGAT 


TTGACGACAC 


ATAAAGTCTG 


TCAAATGGCC 


18960 


TTTTCTGACG 


GACATACGCA 


TGATTTTACT 


CTCTTCAAAG 


AAAGTATTGG 


ACAAAGTTTG 


19020 


CCTGAAACGA 


CGCTTGCCTT 


TGTTGACCTA 


GGTTATTTAG 


GCATCTTGAA 


ATTTCATGAG 


19080 


AATACTTTCA 


TTCCTGCTAA 


AAATTCCAAA 


AATCGCCGCC 


TGAGTGAGGA 


TGATAAGCAG 


19140 


TTAAATAAAG 


AGATGTCAGC 


GATACGAATT 


GAAATTGAAC 


ATTTTAACGC 


TAAATTCAAG 


19200 


ACCTTCCAAA 


TCATGTCAGT 


CCCTTATCGT 


AACCGCAGAA 


AACGTTTCGA 


GTTACGGGCG 


19260 


GAATTAATTT 


GTGC CAT CAT 


CAATTATGAA 


GTG AAC TAG A 


TTCCGAACAA 


GTCTAATATA 


19320 


CTTTTGAGAG 


AGGAAAATCC 


AGTTGTATAG 


GCTAAAGGTT 


TTATCCAAAG 


GT CTG AG AC A 


19380 


ACGATTAGGC 


ACG AT G G AAA 


GAACTTTTAT 


GTGGCTGATG 


ACGATCAGTG 


CATCTTCCTG 


19440 


TGTCATAATC 


ACAGGGCACA 


AGAAAGTAGG 


AATTTGAAAA 


GATGATTGAC 


CAACTATCTA 


19500 


AGTATTACAG 


TTGTAGGATA 


CTAACTGAAA 


AGGATATTCC 


AAGTATTTTA 


TCTTTATATG 


19560 


AAAGTAATCC 


TCTGTATTTT 


CAGCATTGTC 


CACCAGAGCC 


AAATTTTGCA 


ACT GT AAAAG 


19620 


AGGACATGCT 


TTGTCTACCT 


GAAGGTAAAG 


CTAAGGCTGA 


TAAGTTTTTT 


GTTGGATTTT 


19680 


GGAATGGATC 


TGACCTTGTG 


GCTGTTATGG 


ATTTTGTCTA 


TG C AT AT C C T 


GATGAGGAGA 


19740 


CTGTTTTTAT 


TGGTTTGTTT 


ATGGTTGATC 


AAGCCTATCA 


GAGAAAAGGG 


ATTGGTAGTC 


19800 


ATATTGTGAC 


AGAAGCACTA 


GCTTATTTTG 


CTAAGAACTT 


TCGAAAGGCA 


CGTTTGGCTT 


19860 
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ATGTTAAGGG AAATCCGCAA TCTCAGCATT 
GATGCGAGGT TAAGCAAGAA CTCTATACGG 
AGAAATGGCA TCAAGTAAGA ACTATTTGGA 
TGATGTGACT TACCGTTCCA TGATGGGGGA 
TGGCGGCATT TATGACGATC GCTTTTTAGT 
TG AC C AATCT TCTTTTGAGT T T C C AT AC AA 
ACTTGATAAT AAGATGTTTC TATAAGACCT 
GCCCAAACCT AAAAAGAAAA AGCAAGGGTG 
CTGTCTTTGC ACGGGTAAAA TTTTATATAT 
TCCTTTGGTT TATATAATTG T CAT T AC AAG 
TTATTCAATT TAGAGTTTCT TATGCACAAT 
AGTATACAGT ACAATAAACC AACGATGTAA 
ACTTGCGAAG TCACCCTTTT CTTTTCAAAA 
CACGATTAAG TCCTTGAGCA ACTGGTAGGT 
CTTGACGCAA GCCTTCATCT TCAGAGATTG 
C G AT AT AAGG AAGAGTGACA TTGGTTAGGG 
GGATATTGGC AACGGCATAG TGGAGAACAC 
TTGTCACACG GTCAGCTGTT TCGATAACGC 
AGAGCCTGGA CGCATTTGTT TGACCATCTC 
AGGGATGAGA ATGGCTCCAA TCACCACATC 
TGAATTAGAC ATAAGAGTTT GAATTTGACT 
CTTGGAACTA ATATCTAAAA TAGTCACTTG 
ATGTGTACCG ACGACACCAC CACCGATGAT 
ACCACCAAGT AG AAC AC C AG AGCCACCAGC 
AACAGCCATA CGACCTGCAA CCTCACTCAT 
GTCACGAACA GTTTCAGTTG TTTTTGCTGT 
GGCCATGTGC AAGTAGGTGA AGAGAAGAAG 
TAAAGATTCT TTTACTTTCA CAACCAACTC 
AATCTCAGCT CCTTGCTTTT GATAGTCAGC 
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TTTGGGAAAA GCAGGGCTTT 
TTGTTATCGC TG AAC AG AG C 
ATTTGTTTTG GAACAATTAT 
GTATATTCTT TACTTCCGCG 
TAAACCCGTG CAAGCAGTCT 
AGGTGCCAAA GAAATGATTT 
AATTTTAGCT ATGTATAACC 
AACGAAGTAA AAAAGAAGTC 
AAAAAGAAGC TGGGACTAAA 
ACGAAGTGGT TGGGCGAAAC 
TGAGTCTGGA ACGAAAGTCT 
T AG CTG AT G A CACAAAGCAC 
TTTATACTAA ATC AT T GAT A 
TAGTCAAGTA ACCTTGATAA 
CTTGTGCGAA TCCTTTGCCA 
CGATGGTTGA AGTGCGAGCA 
CGTGTTTTTC ATAGACGGGT 
CACCTTGGTC AACAGCAACG 
ATCTGTCACC AATTCCGGTG 
AGCATCTCTC ACACTTGCTT 
TCCAAAGACT TCTTCTAGAA 
AGCACCAAGA CCAAGGGCGA 
AGTTACTTTT CCTTTTGGAA 
TTGCTTAGTA AGGAAGTGAG 
AGGAACGAGG AGCGGTAGTT 
T AAC AT AG C A TCTGCTAATT 
ATCGTCGCGC AAGTAACCGT 
TGCTGCCCAA GCTTCACCAG 
ATCAGTAAAG C C AG AAC CG A 



AAATCAATTG 1992 0 

CTAGAAGATT 19980 

CAGGATTAGA 2 0040 

GC AAG ATT AT 2 0100 

T AGATAAG AT 2 0160 

GAGTGGAAGA 2 022 0 

AACTGCCAAC 2 0280 

TGCTAAGGCC 2 0340 

GAGCTCAGCT 2 0400 

TCTGTTGACT 2 0460 

CCAGTTGCAA 2 0520 

AGTGGGTAGG 2 0580 

TCAGTGTAGT 2 0 64 0 

GTAGTCACAC 2 0700 

GCCAAAGCTT 20760 

ACCGCACCAG 2 0820 

TCATCGTGCG 2 0 8 8 0 

T C AACG AT AC 2 094 0 

CTTTTGCACC 21000 

CAATGTTGAA 210 6 0 

CTGAGAGACG 2112 0 

TGCGGGCAGC 21180 

CACCTGGTAC 212 40 

CTCCGATTTG 213 00 

GTCCTTGATT 213 60 

CTGGAGCAGC 21420 

ATTCAGAACT 21480 

CAGTAGCGAC 2154 0 

GACCAGCATT 21600 
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TGTTTCGATA AGGACACGAT GACCACGACT AACTAAGCTA TGAACACCTG CAGGTGTGAG 21660 

GGCGACACGG TTTTCGTTAT TTTTAATTTC TTTTGGGATT CCGATTAACA TTGAGATAAC 2172 0 

CTACCTTTCA ATTGACGGTC TTGTTTTGGT TGTCACATTC CAGTTCATAA ATCAAAAATG 217 8 0 

TGACGGTTTC ATTGTATATG AAACCGCTTC AAAAATCAAG AAAAACTTGT CATCCAAATT 21840 

TTTTTATGCT AG ACT AGT G A AAATCAAGCT CTAATGGAGG GAAAAGTATG GAATCAATAT 21900 

TTGTGAAATT TGCCCAGTAT CCGTCTATAG AAACGGAGCG TTTATTGCTC AGACCTGTAA 21960 

CTTTGGATGA TGCGGAAcAA TGTTTGACTA TGCCTCGGAC AAGGGTAATA CACGTTACAC 22 02 0 

TTTTCCAACC AATCAAAGCT TGGAAGAAAC CAAGAATAAC ATTGCTCAGT TCTACTTGGC 22 080 

TAATCCCTTG GGACGTTGGG GAATAGAACT AAAAAGCAAT GGTCAGTTTA TTGGAACCAT 2 214 0 

TGACTTGCAC AAGATTGATT CTGTTCTTAA GAAGGCAGCT ATTGGCTACA TTATCAATAA 22200 

AAAGTATTGG AATCAAGGAT TAACGACAGA AGCCAATCGT GCTGTGATTG AGCTAGCTTT 22 2 60 

TGAGAAGATA GGGATGAATA AGTTGACTGC CCTTCACGAT AAGGCTAATC CCGCGTCAGG 22320 

AAAGGTCATG G AG AAAT C AG GCATGCGTTT TTCCCATGCA GAACCATATG CTTGTATGGA 223 80 

CCAGCATGAA AAAGGCCGAA TCGTGACAAG AGTTCATTAT GTCTTGACCA AGGAAGACTA 22440 

TTTTGCAAAT AAATAAGCAG TTGAAAAGAA ATTTTTCGAC TGTTTTTTCT TCCTCTTACG 22 500 

AATAATCTAA GAGAGGAGAA AATATGGAAG CAATTATCGA GAAAATCAAA GAGTATAAAA 22 560 

T C AT CGT CAT CTGTACTGGT CTGGGCTTGC TTGTAGGAGG ATTTTTCCTG CTAAAACCAG 2 2 620 

CTCCACAAAC ACCTGTCAAA GAGACGAATT TGCAGGCTGA AGTTGCAGCT GTTTCCAAGG 22 680 

ACTCATCGAC CGAAAAGGAA GTGAAGAAGG AAGAAAAGGA AGAACCCCTT GAACAAGATC 22 7 40 

TAATCACAGT AG AT GTC AAA GGTGCTGTCA AATCGCCAGG GATTTATGAC TTGCCTGTAG 22 800 

GTAGT CGAGT CAATGATGCT GTTCAGAAGG CTGGTGGCTT G AC AG AG C AA GCAGACAGCA 22 860 

AGTCGCTCAA TCTAGCTCAG AAAGTTAGTG ATGAGGCTCT GGTTTACGTT CCTACTAAGG 2 2 920 

GAGAAGAAGC AGTTAGTCAA CAGACTGGTT CGGGGACAGC TTCTTCAACA AGCAAGGAAA 22 9 80 

AGAAGGTCAA TCTCAACAAG GCCAGTCTGG AAGAACTCAA GCAGGTCAAG GGACTGGGAG 2 3 040 

GAAAACGAGC TCAGGACATT ATTGACCATC GTGAGGCAAA TGGCAAGTTC AAGTCAGTAG 2 3100 

ACGAGCTCAA GAAGGTCTCT GGCATTGGTG GCAAAACAAT AGAAAAGCTT AAAG AC TAT G 2 3160 

TTACAGTGGA TTAAGAATTT CTCTATTCCC CTAATTTACC TGAGTTTTCT ATTACTTTGG 2 3220 

CTTTATTACG CTATTTTCTC AG CAT CT TAT CTTGCTTTGT TGGGCTTTGT TTTTCTGCTA 23280 

GTCTGTCTCT TTATCCAATT TCCGTGGAAA TCTGCTGGTA AAGTTCTAAT AATTTGCGGA 2 3 340 

ATCTTTGGAT TTTGGTTTGT TTTTCAAAAT TGGCAACAGA GTCAAGCGAG TCAAAATCTG 234 00 
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GCGGATTCTG TTGAAAGGGT ACGGATTTTG CCTGATACTA TTAAGGTTAA TGGTGATAGT 234 60 

CTATCCTTTC GTGGCAAGTC TAACGGTCGT GCTTTCCAAG TCTATTATAA ACTCCAGTCC 2 3 520 

GAGGAGGAGA AAGAAGCCTT TCAAGCTTTA ACTGACCTGC ATGAGATAGG ACTAGAAGGG 23580 

AAGCTTTCGG AG C C AGAAGG GCAGAGAAAT TTTGGTGGCT TTAATTACCA AGCCTATCTG 2 3 640 

AAGACTCAGG GAATTTACCA GACTCTCAAT ATCAAAACAA TCCAGTCACT TCAAAAGATT 23700 

GGCAGTTGGG ATATAGGAGA AAACTTGTCC AGTTTACGTC GAAAGGCTGT GGTTTGGATT 23760 

AAGACGCACT TTCCAGACCC TATGGGCAAT T AC AT G AC AG GACTCTTGCT GGGACATCTG 2 3 820 

G AC AC CG AC T TTGAGGAGAT GAATGAGCTT TATTCCAGTC TAGGAATTAT CCACCTCTTT 23 880 

GCCCTATCTG GCATGCAGGT AGGTTTTTTC ATGAATGGAT TTAAGAAACT TCTCTTGCGA 2394 0 

TTGGGCTTGA CCCAAGAAAA GTTGAAATGG CTGACTTATC CCTTTTCCCT TATCTATGCG 2 4000 

GGACTAACTG GATTTTCAGC ATCGGTTATT CGCAGTCTCT TGCAAAAGCT ACTGGCTCAA 2 40 60 

CATGGGGTTA AGGGCTTGGA TAATTTTGCC TTGACGGTGC TTGTCCTCTT TATTGTCATG 2412 0 

CCAAACTTTT TCTTGACAGC AGGAGGAGTC TTGTCCTGCG CTTATGCTTT TATCCTGACC 2 4180 

ATGACCAGCA AAGAAGGGGA GGGGCTCAAG GCTGTTACTA GTGAAAGTCT AGTCATCTCC 2 4240 

TTGGGCATAT TGCCCATTCT ATCCTTCTAT TTTGCGGAAT TTCAACCTTG GTCTATCCTT 24300 

TTGACCTTTG TCTTTTCCTT TCTTTTTGAC TTGGTCTTCT TACCGCTCTT GTCTATCTTA 24 3 60 

TTTGTCCTTT CCTTTCTCTA TCCAGTCATT CAGCTGAACT TTATCTTTGA ATGGTTAGAG 2442 0 

GGCATTATTC GCTTGGTCTC GCAGGTGGCA AGGAGACCAC TTGTCTTTGG TCAACCCAAC 2 4480 

GCATGGCTTT TAATCTTATT GTTAATTTCC TTGGCTTTGG TCTATGATTT GAGGAAAAAC 2 454 0 

ATTAAAGGAT TAACAGTATT GAGTTTATTG ATTACAGGTC TCTTTTTCCT TACCAAGTAT 2460 0 

CCACTGGAAA ATGAAATCAC CATGCTGGAT GTGGGGCAAG GAGAAAGTAT TTTCTACGGG 24 6 60 

ATGTAACTGG G AAAACC AT T CTCATAGATG TAGGTGGTAA GGCAGAATCT TATAAGAAAA 247 20 

TCAAAAAATG GCAAGAAAAG ATGACGACCA GCAATGCCCA GCGAACCTTG ATTCCCTATC 247 80 

TCAAAAGTCG AGGAGTAGCT AAGATTGACC AGCTAATTTT GACTAACACG GACAAGGAGC 24 840 

ATGTTGGAGA T TTGT C AG AG ATGACCAAGG CTTTCCATGT AGGGGAGATT CTAGTATCAA 249 00 

AAGACAGTCT GAAACAGAAG GAATTTGTGG CAGAACTACA GG CG ACT C AA ACAAAGGTGC 249 60 

GTAGTATGAT AGTAGGGGAG AACTTGCCCA TTTTTGGAAG TCAGTTAGAA GTTCTATCTC 2 5020 

CAAGGAAAAT GGGAGATGGA GGACACGATG ATACCCTAGT TCTGTATGGG AAATTCTTGG 2 5080 

ATAAGCAATT TCTCTTCACG GGAAATTTGG AGGAGAAAGG AGAGAAGGAC TTGCTGAAGC 2 514 0 



WO 98/18931 



PCT/US97/19588 



288 

ACTATCCAGA CTTGAAAGTA AATGTTTTGA AAGCTAGCCA ACATGGCAAT AAAAAATCAT 2 52 00 

CAAGTCCAGC CTTTCTAGAA AAACTCAAAC CAGAGCTTAC TCTTATCTCA GTTGGAAAGA 252 60 

GCAATCGAAT GAAACTCCCC CATCAGGAAA CATTGACACG ACTGGAAGGT AT C AAT AG C A 2532 0 

AAGTTTATCG AACTGACCAG CAAGGAGCTA TACGTTTTAA GGGGTTGGAT AGTTGGAAAA 2 5380 

TCGAAAGTGT TCGATAGGAA GGATAAATGT TGTAGATTAG TGAAATAAAC TAAAAATTTG 2 544 0 

TTGCATAATA ATGATAAAAA TGGTATAATG AAAACGTATT CAATATTGAG GATATAAAAT 2 5500 

CAT T AAAAAT C AG C AAAAGT TGTTTTATTA GTTAGTTTAT AATCTATTGG TCTTCTTCAG 2 5560 

TCCAGTGTAT CTGCTGTGAC AGTCACTAAA AGTTACAAGT ATGATTGGAA TACGGTTTGG 2 5 620 

GAATATAGTA CCAACTATCA CGACCATCAG TATGCTTGGA TTCCGTCATG GTCTCGTTAT 25680 

G AC AGCT AT T CTGAGTATAA AGTTGGCGGA GGCTGGAACT ACGCTCGTTA TGAGGTCATA 2 5740 

AACTATTACA GCGGAGGCTA TTAATTCTTA AAG AG T GAGA AAAAGGAGGG CTAGATATGT 2 5 800 

TGCAGCTTAC TCATGTGACC TTAAAAACGC GACAAGTCAT CTTGCAAGAT GTGGATTTCA 25860 

CCTTTAAAAA GGGTAGGGTT TATGGTCTTC TTGCTATCAA TGGCTCTGGA AAGACGACCC 2 5 920 

TGTTCCGTGC CATTAGCAAT TTAATTCCCA TAAGTAGTGG AAATATCGCA GCCCCTCCTT 2 5980 

CTTTATTTTA TTATGAGAGT ATTGAATGGC TGG AT GG AAA CTTAAGTGGG ATGGACTACC 2 6040 

TTCGTCTTAT CAAAAACATC TGGAAGTCAG GTCTGAACTT GAGGGATGAA ATCGCCTATT 2 6100 

GGGAAATGTC TG AC TAT AT C AGTCTTCCCA TTCGCAAGTA TTCCTTAGGC ATGAAGCAAC 2 6160 

GCTTGGTGAT TGCCATGTAT TTCCTCAGTC AGGCCAAATG CTGGCTCATG GATGAGATTA 2 6220 

CAAATGGCTT AGATGAGTAT TATCGACAGA AGTTTTTTGA TAGGCTAGCA CAAATCGATA 2 6280 

GACAAGAACA GCTGGTTCTT TTAAGTTCCC ACTATAAGGA AGAGTTGGTT GATGTCTGCG 2 6340 

ATAGAGTAGT AACCATTCAT CAGGGGCAGA TAGAAGAGGT TTAGTTTATG AAAGATGTTA 26400 

GTCTATTTTT ATTGAAAAAA GTTTTCAAAA GCCGCTTAAA CTGGATTGTC TTAGCTTTAT 2 6460 

TTGTATCTGT ACTCGGTGTT ACCTTTTATT TAAATAGTCA GACTGCAAAC T C AC AC AG CT 2 6520 

TGGAGAGCAG GTTGGAAAGT CGCATTGCAG C C AAC GAG AG GGCTATCAAT GAAAATGAAG 2 65 80 

AGAAACTCTC CCAAATGTCT GATACCAGCT CGGAGGAATA CCAGTTTGCT AAAAAT AATT 2 6 640 

TAGACGTGCA AAAAAATCTT TTGACGCGAA AGACAGAAAT TCTGACTTTA TTAAAAGAAG 2 6700 

GGCGCTGGAA AGAAGC CT AC TATTTGCAGT GGCAAGATGA AGAGAAGAAT TAT G AATTTG 2 6760 

TATCAAATGA CCCGACTGCT AGCCCTGGCT TAAAAATGGG GGTTGACCGC GAACGGAAGA 2 6 820 

TTTACCAAGC CCTGTATCCC TTGAACATAA AAGCACATAC TTTGGAGTTT CCGACCCACG 2 6880 

GGATTGATCA GATTGTCTGG ATTTTAGAGG TTATCATCCC AAGTTTGTTT GTGGTTGCTA 2 694 0 
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TTATTTTTAT GCTAACACAA C T ATT TG C AG AAAG AT AT C A AAATCATCTG G AC AC AG C T C 2 7 000 

ACTTATATCC TGTTTCAAAA GTGACATTTG CAATATCCTC TCTTGGAGTT GGAGTGGGAT 2 7060 

ATGTAACTGT GCTGTTTATC GGAATCTGTG GCTTTTCTTT TCTAGTGGGA AGTCTGATAA 2 712 0 

GTGGTTTTGG ACAGTTAGAT TATCCCTACC CAATTTATAG CTTAGTGAAT CAAGAAGTAA 27180 

CTATTGGGAA AATACAAGAT GT AT TAT T T C CTGGCTTGCT CTTAGCTTTC TTAGCCTTTA 2 7240 

TCGTCATTGT GGAAGTTGTG TACTTGATTG CTTACTTTTT CAAGCAAAAA ATGCCTGTCC 27300 

TCTTTCTTTC ACTCATTGGG ATTGTTGGCT TATTGTTTGG TATCCAAACC ATTCAGCCTC 273 60 

TTCAAAGGAT TGCACATCTG ATTCCCTTTA CTTACTTGCG TTCAGTGGAG ATTTTATCTG 2 742 0 

GAAGATTACC TAAGCAGATT GATAATGTCG ATCTAAATTG GAGCATGGGA ATGGTCTTAC 27480 

TTCCTTGCCT GATTATCTTT TTGCTATTGG GAATTCTATT TATTGAAAGA TGGGGAAGTT 2 7 540 

CACAGAAAAA AGAATTTTTT AATAGATTCT AGCTTTCCTA TAGGTAGGGA AAATAAGTAA 2 7 600 

AAACTAACAT AGAGAGGGAA TCAACTTGAT TCTCTCTTTT T G ATT C G AAA ACCAAACCAA 27 660 

AATACAAACA CAAACTTTTC AAAAAATAAC TTTTTATCTT GACAAGAGCT AGAAAACTTG 2 772 0 

GTATCATATA AAAGTTGAGA AAAGCAGAAG TGAGAGCTTC TCGCCTTGTG ACATTAAGTT 27 780 

GCCTGGCCCT ACGGATGAAA AGTTTCGAAG AAACGCTATC ATAACGTGCG GGCTTGTATA 2 7 840 

TTTACAAGTC CGCTATTGTT TTTCTCTAAT AAAACAAAAG AGGTGAAAAC CATAGCAAAG 2 7 900 

CAAGACTTAT TCATCAATGA TGAGATTCGT GTACGTGAAG TTCGCTTGAT TGGTCTTGAA 27 9 60 

GGAGAACAGC TAGGTATCAA GCCACTCAGT GAAGCGCAAG CTTTGGCTGA TAACGCTAAT 2 8020 

GTTGACCTAG TATTGATTCA ACCCCAAGCC AAACCGCCTG TTGCAAAAAT TATGGACTAC 2 8080 

GGTAAGTTCA AATTTGAGTA CCAGAAGAAG CAAAAAGAAC AACGTAAAAA ACAAAGCGTT 2 814 0 

GTTACTGTGA AAGAAGTTCG TCTAAGTCCG G 2 8171 

(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7147 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCGCTCAACT TTTGCAATCA AGGCTAAGTA GACAGCAGCA AATTTCATAT TGTATAATTT 6 0 

CTGACTCATA CTTCTCTCTT TCTATGTGTA CTAGTATAAA TAAGAAAAAG AAGGCCGTCA 12 0 
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AGCCTTCTTT TGATTTATTC TTCTGCTTCA TCTTCTGTAA ATTGACTATT GTACAAGTCA 180 

GCGTAGAAGC CACCTTGCGC CATCAGTTCC TCATAGTTGC CTTGCTCGAT GATATTTCCA 240 
TCTTTCATGA CCAAGATCAA GTCTGCATTT CGGATGGTTG ACAAGCGGTG GGCAATGACA 3 00 

AAGGATGTGC GTCCTTCCAT CAAACGGTCC ATGGCTTTTT GGATCAATTC CTCTGTCCGT 3 60 

GTGTCAACAG AAGAAGTCGC CTCATCCAAA ATCAAAAGCG GTGCATCCTT AAGAAGGGCA 42 0 

CGAGCAATAG TCAATAGTTG TTTTTGTCTT ACAGACAAGG TCACGGTGTC ATCCAAGATG 480 

GTATCATAGC CATCTGGCAA GGTCATAATA AAGTGGTGAA TTCCCACAGC CTTACTAGCT 54 0 

TCCATCATTC GTTCATCACT AATCCCTATT TG AT TAT AG A TGAGATTGTC TCGAATAGTT 6 00 

CCTTCAAAGA GCCAGGTATC CTGCAAGACC ATTGAAAAGG CATCATGCAC TTCTGAACGC 6 60 

GTCATAGCCT TGGTATCCAC AC CAT C AATG CGAATACTTC CCTTATCAAT CTCATAGAAT 720 

TTCATCAAAA GATTGACAAT GGTTGTCTTA CCAGCCCCAG TCGGCCCAAC AATGGCAACC 780 

T TTTG AC C AG CATGAGCTGT CGCAGAGAAG TCATAGTCTT GAACATTGAC ACCGTCCACC 84 0 

AGAATTTCTC CTGCTGACAC GTCGTAGAAA CGTGGAATCA GATTGACCAG AGTTGATTTA 900 

CCAGAACCTG TTGACCCAAT AAAGGCCACT GTTTGACCAG TTTCTGCTTT AAAGCTAACA 9 60 

TGTTCAATAA CTGCCTCCGA ATTTGCCGCA TAGCGgAAGG TCACATCCTT AAACTCGACC 1020 

TGACCTTTGA AGTTTTCATC AGTCAGCTGC ACTTGAACAG GGTTTTGGAT AGAAGAATGC 10 80 

AAATCTAAAA CTTGATTAAT CCGCTTAGCA GAGACCATAG TTCGGGGAAG AACGATGAAG 114 0 

AGTGCTCCCA TGAGAAGGAA GCCCATGACA ACCTACATGG CATAAGACAT GAAAACAATC 12 00 

ATGTCACTAA AGAGAGGCAG ACGCGCTATC GGAGCAGCGT CGTTAATCAC AT AGGCC C C A 12 60 

ATCCAGTAAA TCGCCACACT CAAACCACTT GAAATCCCCA TCATGATAGG ATTCAAAATA 13 2 0 

GCCATAAGAC GGTTGACAAA CAAATTCAAA CGGGTCAATT CATCATTTAC TGCTGCAAAT 13 80 

TTTTCATTTT GATAATCCTC TGCATTGTAG GCACGAACGA CACGAATACC TGTTAAACTC 1440 

TCACGAGTGA TACTGTTCAG TTTATCTGTC AGCCCCTGAA TCAAGGACTG TTTTGGAAAG 1500 

GCTAGCGTCA TCAAAACGGT CGTCATCAGG ACGTTGATAA TCACTGCCAC AAGTACGGCC 1560 

CAGAGCCAGT ATTCTGAATG ACCTAAAATC TTCCCAATAG CCCAGATAGC CATAATTGAA 162 0 

CCACGCGTTA CCACTTGCAA GCCCATAGTA ATCAACATTT GAACTTGAGT AATGTCATTG 1680 

GTAGTACGCG TCAAGAGGCT AGGAATTGAA AATTTCTTAA TCTCTGTCTG CGAGTAATCC 1740 

AAAACTCGGT T AAAAAT AT C ACTTCTCAGC CTACTAGTAT AAGAAGCCGC CACTCGGGAT 1800 

GCAAAAAATC CAACTGCAAC TACGGACAAG AAGGCAAGAA AGGACATTCC CATCATCATG 1860 

CTTGCCGACT GCCACAACTC AT C TAAATT A GTTTCTTGAC T AC C TAG C AA ATCCGTAATT 192 0 
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TTCGAGATAT AGGTCGGCAC TTCCAACTCT AG AT AG AC CG AAAAGCAAGT AAAGAGAATG 19 80 

GCTAGTAAAA TCATCCCCCA TTCTTTTCTA CTAATTCTTT TGGCTAATTT CTTTATTCTC 2 04 0 

TCCTCCTATT CCCTTGATAT TTTGCCTGTA GTTGACCGAG AACCTTCTCA AAAATCAGTA 2100 

ATTCATCTTC ATCAATGTCT TCCATCAACT GCTTGTCTAT GCGTTCAAAA AAAGCCTTAA 2160 

CCTGTTGCAT CTGAGAACGT GCTTTGTCCG TCAGACGAAC AAACTTAGCC CGCTTATCAA 222 0 

CAGGACTCGC CTCCAATTCC ACCAAACCAT TTTGCACTAT ACGCTTAACC AGATTACTAG 22 80 

CAACAGGCTT GGTAATATTG AGTTCCTGCT CGATATCTTT AATCAAGACC AAGTCTTGGT 2 34 0 

TTTTCTCGCG AT TAT C C AAA AAACGCACAA CCTGACCTTG CGGCCCACCC ATAAATTCAA 2400 

TGCCGCAACG TTTGGCTTCC TTTTGCACCA TCAGGTGAAT TTGATGACCA AAACGCTTAA 2460 

AGACTAACAT CGGTTTATCC ATAATCTCCC CCTTCTAAAT AAAAATAGTT CTCTGGAGAA 2 52 0 

TAATTAAATT TCTATGAGAA CTATTTTCTT GATTAAAAAA ATCCCAAGTG ATTTTCTCAC 2 5 80 

TTAGGATCAT GTTCTATAGG TTAAATTAAA ACCCATCTAC GTTCGTATAA ATCTTTTGGA 2 640 

CGTCTTCGTC GTCTTCAAGA ACGCTGTAAA GTTTTTCAAA GGTTTCAAGG TCTTCGCCTG 2 700 

ACAATTCCAC TTCTGACTGA GGAATCATTT CCAATTCAGT CACTTGGAAT TCTTCAATAC 27 60 

C AG AC TC AC G GAGGGCAACG ATAGCCTTGT GAAGGTCAGT TGGCGCTGTG TAAACTGTGA 2 82 0 

TTGTACCTTC TTGTGCTTCT ACGTCATCCA CATCCACATC CGCTTCGAGC AATTGCTCAA 2 880 

AGACTGCGTC CGCATCTTCA CCTCCAAATA CAATAACACC TTTGTTGTCA AAGAGGTAAG 2940 

AAACAGAACC TGAAGCGCCC ATGTTTCCGC CGTTTTTACC AAAGGCTGCA CGGACATTGG 3000 

CTGCTGTACG GTTGACGTTA GAAGT C AAAG TATCCACAAT TAGCATAGAG CCATTTGGCC 3 0 60 

CAAAACCTTC GTAACGTCCT TCTGTAAAGG TTTCGTCTGT GTTTCCTTTG GCTTTATCAA 312 0 

TCGCTTTATC GATAATGTGT TTTGGCACTT GGGCTTGTTT AGCACGGTCG ATAACGAATT 3180 

TCAAAGCTGA GTTTGATTCT GGATCTGGAT CACCTTTTTT AG C TG C T AC A TAGATTTCTA 3 24 0 

C AC C AAATTT TGCATATACT TTAGAGTTAG CTCCATCTTT AGCCGTTTTC TTGGCTACGA 3 300 

TATTGGCCCA TTTACGTCCC ATTAGGAATC TCCTTTTTTC ACATTTTAAT CTTTCTTATT 3 3 60 

ATAACACAAG TTTTTTTGAT TTTCACTAGA GGAAATGGAT TTTATTAGCA AATCAAGCTA 3420 

GGATAGCACT TTACCTGCTA AGATGGTCTT GCCTTTCTAT CTTTATCAAC AGGCACTCAT 3480 

C C AC ATT C AA AAAACAAACT AGACCATTAT CTGCAAATAG AAAGTTTCAG CCAAGTTTGA 3 54 0 

CAAAGTCAGC TCAAATTACT GTTTGAAGTT TGTAGATATA AGCGACAAAA ACAATCATAC 3 600 

TGCACCTTTT GTTGACAGTC TACTCCAGAC AT AT C AT AGT TCAAGTAAAT ACTTTGAAAT 3 660 
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TCAACAGTTC TTATAGGCGC TATTGTATTC TAAGAAATCA ATAGAAGAGT TTCTAAGCAA 3 72 0 

ACCTCTAATA CTCAATAAAA ATCAAAGAGC AAACTAGAAA GCTAGCCTCA GGTTGCTCAA 3 7 80 

AACACTGTTT TGAGGTTGCG GATGGGGCTG ACATGGTTTG AAGAGATTTT CGAAGAGTAT 3840 

AATTTACGTG TTCCCAAGAT GGAGAAGTTA GACTAGTACA CTGGCACTTC TAAAACATTG 3 9 00 

CTAGCAATTG ATTTGTTCAT ATTTAATTTC ATTTTTTCCA TAAATGGGTA TTAGATATAA 3 9 60 

AC AG C AAAAT ATTTCCGATA CGTGTCGTTC TTGAATTTCC AATCATCTAA AACAAGTAAA 4 02 0 

GGATAATCAA TCCCCTGTAT ATCAAGGAAT TGGCTACCCT TTTTACTTTT T T AC AC ATT C 408 0 

TGTTTGATAG ATTCATTTTA ACATCACGAG CATACTCCAA TGGAAATCGC TAGGCAAGAG 414 0 

ATAAACTTTC AGATATCCGC AG AGAG AT C A TCGCCTCTTT TTGTCGCAAG CATTCTCCTC 42 0 0 

TCCTAGTCAT TTTCTACCTT ATCTTCTACC TGAGGATAGA GAGTTGTTCC CCAAATAGAA 42 60 

ATCGTCCGCT T ACGC AC T AG TGGCAAATCG GTTTTTTCAT AAACCGTACG CCACCATTCC 43 20 

CAGGCAAGCC CGGTACACTC TCTAATTTTG ACAGAGAGAT TACGAACATT CCCTTTTAAA 4380 

GG AAT AC TAG TGGTAAAGTG AGCCGTTAAA TCCTGCCCAT TTCTGTCCCA AGCCTTAGGA 4440 

GTCAAGACTT CCTTACCTTG ATGATCATAG GATAATTCAT TCCAAGTAAT ATAATATTGG 4 500 

GCAACATAGG CACCACTATG ATCCAGCAGT AAATCTCCGT TTCTGTAAGC TGTAACCTTA 4560 

GTCTCAACAT AGTCTGTACT ATTTTGAAAG GTCGCAACTA CATTGTCACG TAAAAAAGAA 4 62 0 

GTTGTATAGG AAATCGGCAA GCCTGGATGA TCTGCTGTAA AGCGACTGCC TTCTTGAATC 4 6 80 

AAGTCCTCTA CCATATCCAC CTTGCCTGTT ACAACTCGGG CACCCGAACT TGGGTCGCCC 4 7 40 

C C T AAAAT AA CCGCCTTCAC TTCTGTATTG TCCAAAATCT GTTTCCACTC TGTCTGAGGA 4800 

GCTACCTTGA CTCCTTTTAT CAAAGCTTCA AAAGCAGCCT C T AC TT C AT C ACTCTTACTC 4 8 60 

GTGGTTTCCA ACTTGAGATA GACTTGGCGC CCATAAGCAA CACTCGAAAT AT AG AC C AAA 4 92 0 

GGACGCTCTG C AG AAATTC C TCTCTGTTTT AAATCCTCTA CCGTTACAGT ATCTTGAAAC 4 980 

ACATCTCCTG GATTTTTAAC AGCATCTACG CTGACTGTAT AATAAATCTG CTTAAAATTA 5040 

ACAATCTGAA TCTGCTTTTC GCCTGAATGG ACAGAGTTAA AAT C AAT AT C AAGAGAATTC 5100 

CCTGTCTTTT CAAAGTCAGA ACCAAACTTG ACCTTGAGTT GTTCCATGCT GTGAGCCGTG 5160 

ATTTTTTCAT ACTGCATTCT AGCTGGGACA TTATTGACCT GACCATAATC TTGATGCCAC 522 0 

TTAGCCAACA AATCGTTTAC CGCTCCGCGA ACACTTGAAT TGCTGGGGTC TTCCACTTGG 5280 

AGAAAGCTAT CGCTACTTGC CAAACCAGGC AAATCAATAC TATAAGTCAT CGGAGCACGA 5340 

TCGACCGCAA GAAGAGTGGG ATTATTCTCT AACAAGGTCT CATCCACTAC G AG AAGTGC T 54 00 

CCAGGATAGA GGCGACTGTC GTTGGTAGCT GTTACAGAAA TATCACTTGT ATTTGTCGAC 5460 



WO 98/18931 



PCT/US97/19588 



293 

AAGCTCCGCT TCTTTCTTTC GATAACAACA AACTCATCGG GTAGCTGATT ACCCTCTTTG 5520 

ATGAAACGAT TTTCAATACT TTCTCCCTGA TGGGTCAAGA GTTTCTTTTT ATCGTAATTC 5580 

AT AG CT AGT A TAAAGTCATT TACTGCTTTA TTTGCCATCT TCTACCTCCT AATAAGTTCC 5640 

TGGATTGAGT TGCATAAACT CAGACTTGTT CAGCGAAATC AGCCGTGGTT GGACTAAGTA 57 00 

ATCCAAAATT TCCTCGTACA ATTCTTCTGA GACATTGCGT CGCCGTCTGG CTAAATAAGA 57 6 0 

AGTCGGAATG ACCGTATTAT CCAACATAAA TACCTTATCT AAGTCAATCA AGGTTGGTCT 582 0 

TGTAAAAGGA TTACGAGCTA GATCCGGCTC TTCTATCATA AAGTTCTTGA CCAAACGTCT 5880 

GGTCAAGAGA GCTGGTTTGA AGGTCTGATT TTTAACCAAC TCTTTGTTTT TAGTCATGCT 5 940 

GTTGTCAATA CAGATATACA TATGATTCTT C AC AG C C AAA TCGCTACTAA TAGTCGGAAA 6000 

AGGCAAATAA AGAGCTACAA CATCTCCTCT CTTAATCAAG CAAGAGCACC CCCTTTTCTC 6060 

CTAATGTAAC ATAGACAGGA TTGACCAAGT CTTCTGATTG ACTCAGAATT TCCAAAGTTT 6120 

GAGTTTGGCG CGCTGTCAAT TTAGTAGCAT CTTGTCTCTT CAATACAAAA TGCTTGTCGC 6180 

CAATAACCTT GACAATATAA TCCTTCTCCA AAGCTGACTG GTAAATCCAC ATCAGATGTT 6240 

GTCTGTCCTG AGAACTCAAG AGAGAAGGAT TTTCAAGCCT CCCGATAGTC TGATAAAAAT 63 0 0 

CAAAAACAGG AGCTAACTCC TGCCAATCTG ATTGGCTAGT TGTCAAGGCT AGAAAAAGGG 63 60 

CTTTGCGAGC TGATACTTCT TGGTTAGCCT TGAGAGTTAC TTTCCCCTCC AAGTTTTTTA 6420 

GAAATCGGGA AACTCCAGAA AGCAAATTTT TCTCTAACTG CGAGAAATAA AAACCTTTCG 64 80 

TTCCCAGACA TAAGTCTTTC ATGTCGCTTT CTCTAGCAAA TAAGAGCTCA AACATTTGAT 6540 

AGTAAAAGAA AAATATCTGG CACTGGGTCG CGCTCATCTT TTCCTTATCG GCTTCTTTTT 6 600 

T T AACC AG AG CAAGGGCGAC AGGTAGCTGG ATTGAGACAT TTCCTCTACC TCCTACTCTT 6 66 0 

TTTTAACTGG AGCATCTGCA CTAGCTGCCA CTTCTTTTGA CTGGATACTT TCCCACTGGT 672 0 

TAATCTCCTC TGAGATAAGA CCTTCGCATG TCTTGACAAA TAGGGCAAAA GCCTTGGTCT 67 80 

TTCCTGCATA TTTCTCCGTT TGGCATTGAT AGAGGAATTT TTCTTTCTCC AGGAGTTGCG 6840 

CAGTTTTTTG GTAAGAAATC CAATTTTCCT TTGCATTATA CAAATTGATA ATCCCCTCAC 6900 

ACAGCAAGCC GAGACTGGAT AAGGCAACCG AAATCAAACG GTAGCGATCA CCTGGCATAG 6 960 

GAATAGCACA AAAGACAGCT ATGAGGAAAC CTGCCACGAT TTCTGTTATT TTTAATACCT 7 020 

TATAGCGCCT ACGATGTTGA ACGCTTTTCT TTAAAAAATG AGCTATCTGT ACGTCTAATC 7080 

GCTCTGTCAG GTACATTTCT TCTGGCGTCA TATTCGTAAC TCCTTTCATT TACTTTGATA 7140 

ATCAGGG 7147 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCATGGGA TTGGTGTCCT TTTGGGCAAT CTCTTTGACC AAAC TGG AAA C ATGTTT T AT 60 

GCGCCTGCCT TTACTGCCCT TGTCGGCGGT ACGTCTATAT GATCCTAGTC GCAAAAGTTC 12 0 

CGCGCTTTGG AGCCATTACC ACTATCGGCC TTGTCATTGC CCTCTTTTTC TTGGGAACTA 180 

AACACGGTGC TGGTTCCTTC CTTCCTGGAA TTATCTGTGG CCTCCTAGCA GATGGAGTAG 240 

CTCATTTAGG AAAATACAAG GACAAAACAA AGAACTTCCT TTCTTTCATT ATTTTCGCCT 3 00 

TTAGTACAAC AGGACCAATC TTGCTTATGT GGATTGCGCC CAAAGCCTAT ATGGCTACTC 3 60 

TTCTGGCAAG AGGAAAATCC CAAGAATATA TCGACCGTAT CATGGTCGCT CCAAACCCTG 42 0 

GAACTGTCCT TCTATTTATC GCAAGTATTG TCATCGGAGC CCTAGTGGGT GCCTTGATTG 480 

GACAAGCCTT GAGTAAAAAA TTTGCCCAGA AAATCTGATC AGTTAAAAAG AGCCACGCGG 54 0 

CTCTTTTTTA TTTATGGCTC AATTTCTTAG TCAAGAAATC TCCCAAGAAT TGGATTGCAA 600 

AGATAATCAA AATGATAATA ATGGTTGCCA AGATGGTCAC ATCGTGATTG TAGCGGTTAA 6 60 

ATCCATAAGC GATGGCTACG T T ACCG AT AC CACCAGCTCC AACCGCACCG GCCATAGCTG 72 0 

TTtcCCAACA AGGGaAtCAA GGTcACAGTC GTCAC 7 55 



(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTCAATTGGT ATCTCAATCA ACGGTCTTCA CATGGTTTCA ACTGGTTTGA CTCTTGAAAA 60 

AGCGAAAGCT GCTGGTTACA ACGCAACTGA AACAGGCTTT AACGATCTTC AAAAACCAGA 120 

ATTCATGAAA CATGACAACC ATGAAGT AG C AATTAAGATT GTCTTTGACA AAGATAGCCG 180 

TGAAATTCTT GGTGCCCAAA TGGTTTCACA TGATATTGCA ATTAGCATGG GAATCCACAT 240 

GTTCTCACTT GCTATCCAAG AGCATGTGAC AAT TG AT AAA TTGGCATTGA CAGACCTCTT 3 00 
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CTTCTTGCCA CACTTCAACA AACCATACAA CT AC AT C AC A ATGGCTGCCC TTACGGCTGA 3 60 

AAATTAAAAA TGAATGAGCT ATCTGGCCTT AAGTTAAGGT CAGATAGTTT TTAGCTAATT 42 0 

TGTCCCCATA CAATTATAGT TTTTTTATCT TGTGCTTCAT TCTGTTCTGA CTTAAAATGA 480 

AAAGGTAGCT ACCAATACAA ATGATGAGGA TAAAACAAAT GACTGAAAAT CGTTATGAAC 540 

TAAATAAAAA CTTGGCACAG ATGCTCAAGG GTGGTGT TAT TATGGATGTG CAGAATCCTG 600 

AACAGGCTCG TATCGCAGAA GCTGCTGGTG CGGCAGCTGT GATGGCCTTG GAACGAATTC 6 60 

CGGCTGATAT TCGTGCAGCT GGAGGAGTTT CCCGCATGAG CGACCCAAAG ATGATTAAGG 720 

AAATCCAAGA AGCGGTTAGT ATTCCAGTAA TGGCTAAGGT CAGAATCGGG CATTTTGTTG 780 

AAGCTCAGAT TTTAGAGGCT ATTGAAATTG ATTATATCGA CGAGAGTGAA GTTCTATCTC 84 0 

CAGCTGATGA CCGTTTCCAT GTGGACAAGA AAGAATTCCA AGTTCCTTTT GTCTGTGGTG 900 

CTAAGGATTT GGGTGAAGCC TTGCGTCGTA TCGCTGAAGG TGCTTCCATG ATTCGTACCA 9 60 

AAGGAGAACC AGGGACAGGG GATATCGTCC AAGCTGTTCG TCATATGCGT ATGATGAATC 102 0 

AGGAAATTCG CCGCATTCAA AACTTACGTG AGGACGAGCT TTATGTTGCT GCCAAGGATT 1080 

TGCAAGTCCC TGTAGAATTG GTCCAATATG TTCATGAACA TGGAAAATTG CCAGTTGTAA 1140 

ATTTCGCTGC TGGAGGTGTT GCAACGCCAG CAGATGCTGC GTTAATGATG CAATTAGGGG 12 00 

CAGAGGGGGT CTTTGTCGGT TCAGGTATTT TCAAGTCAGG AGATCCTGTT AAACG AG CG A 12 60 

GTGCCATTGT TAAGGCTGTG ACT AAC T T CC GTAATCCTCA AATCCTAGCT CAAATCTCTG 132 0 

AAGATTTAGG AGAAGCCATG GTTGGTATTA ATGAAAATGA AATCCAAATT CTCATGGCTG 13 80 

AACGAGGAAA ATAGATGAAA ATCGGAATAT TGGCCTTGCA AGGGGCCTTT GCAGAACATG 1440 

CAAAAGTGCT AGATCAATTA GGTGTCGAGA GTGTAGAACT CAGAAATCTA GATGATTTTC 1500 

AG C AAG AT C A GAGTGACTTG TCGGGTTTGA TTTTGCCTGG TGGTGAGTCT ACAACCATGG 15 60 

GCAAGCTCTT ACGTGACCAG AACATGCTAC TTCCCATCCG AGAAGCCATT CTATCTGGCT 162 0 

TACCAGTGTT TGGGACCTGT GCGGGCTTAA TTTTGCTGGC TAAGGAAATC ACTTCTCAGA 1680 

AAGAGAGTCA TCTAGGAACT ATGGATATGG TGGTCGAGCG TAATGCTTAT GGGCGCCAAT 174 0 

TAGGAAGTTT CTACACGGAA GCAGAATGTA AGGGAGT TGG CAAGATTCCA ATGACCTTTA 1800 

TCCGTGGTCC GATTATCAGT AGTGTTGGTG AGGGTGTAGA AATTTTAGCA ACAGTGAACA 1860 

ATCAAATTGT TGCAGCCCAA GAAAAAAATA TGTTGGTAAG TTCTTTTCAT CCAGAATTGA 1920 

CTGATGATGT GCGCTTGCAC CAGTACTTTA TCAATATGTG TAAAGAAAAA AGTTGAGATT 1980 

GAATTTCTCA ACTTTTTTAC ATGTAATAAA CAATAGCGAT GTATTGAAGT GCGGACGCAG 2 040 
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CTAGGATAAA GAGATGCCAA ATCATGTGGA AATAAGGTTT TTTCTTGGCA TAAAATCCAG 2100 

CTCCAACTGT ATAACAGAGT CCGCCAGTTA CCATGAGACT C C AG AAAACG GGTGTCGTTT 2160 

GACTGATAAT GGCAGGAATG ATAGCCAGAA CCAACCAGCC CATAATCAGG TAAAGAGCAA 22 2 0 

GGCTAAATTT CTCATTGACC TTTTTAGCAA AGATTTTATA GAGAATACCA AAGATGGTCG 2280 

TTCCCCATTG GATGACAATA ATCAGATAGC CAAACCAGTT ATTCATCAAG GTCAAGACAA 2340 

CGGGCGTGTA TGAGCCGGCA ATGGCAACGT AAATCATAGA ATGGTCAATG ATTCGCAAAA 2 400 

CATATTTGTG GGTCGAACCA TAGGCCATAG AGTGATAAAT GGTGGATGAT AGGAACATGA 2460 

G AAAG AG AC T GATGACGAAA ATGGAAACGC CGATAGAGGA TAAAAATCCG TGTGCTTCAT 2 520 

AACTATAGAT GGATGAAATA GGCAGCAAGA TAAGCATGAT GACTGCACCC ACAGCATGGG 2 580 

TCACGCTATT AGCAATCTCC TCTCCAAAAC TGAGTTGTTT GCTGAGTTTA AGACTAGTGT 2 64 0 

TCATTGGATT ACCTCCTCTT GAGTATGATC GATTAAGTCT AGAGTTTGAT GATAGAGTTT 2 7 00 

AACGGTTTGG CAGCTGGTTT GGATAATAGG GTTAGCTGGG TCAATTCCTT GGTTCATGTA 2 7 60 

GTCCACAAAA GCATCGTAGA GTTGGTCTGA ACTTGCTTGA GTTTGTAGAG TATTAAGTGT 2820 

CTGGGCTATT TCTTGAATAG AAAATACAGA CTTGAGGGTT GTGATAGCAA TCAAACGGGC 2880 

AATCTGTTGG CGTTGGTATT TTTTTTTGTC AGGCTTTGTC AGGTAACCAT TTTTCACATA 2 940 

ATTGTTGACC ATAGATGCTG TTAGGCCCTT GTCTTTATTA GGAGAGATAG GGGCGCAGAC 3000 

CTGATTGACA 3 010 



(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CATAAATCGG TGCAAATAAC TTAATAGTGA AGTAGCCATT TCTTTCGTAT TTACCTGAGG 60 

CATATTCCCT AGACGAAAGA ATATTATTAT CAATCAAATC ATTGAATGAA CGTAGTCTTT 12 0 

CAACTTCTTC TACTGTTAGA TTTCTGACAA CATTTGTTGC ATAGACCTTA TTTCCATCAG 180 

GAT CAGGATG GTACTCATTT GTAACTTTTC TAAGAAGTTG TTGTTTTTGA TTCGTATCCA 240 

ATTTAAGAAT TGAATTTCCT T CG AG AT ATT CCAACATATA AACAACGTCA AACATGTTGT 3 00 

GGAC AT AT TG CTTCAAATCA TCTGCATTAT TAAATCTTGT AGTTGGATCA AGTACTTGTA 3 60 

ATCGTCGACT TTCTGTACTA TCAGATTTTG AATGTTTCAA GATGGAGTTG ATGGTAATGG 42 0 
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TCGCATCATC TGGATGGTCT GGTGCTTGTA ATAATCCTTT AGCAAAGAAC TCTGGTCCCA 480 

AGCCACTTCT TCGACCATAT CCTCCAAGAT AAATGTCCTG ATCTGAGTCA TGTGTCATCT 540 

CATGCGTATA AGTAATAGCT C CAT C CTT AT CCAACATTCG ATAACCCATA TAATAAACTG 600 

CATCACCTGT AGCATAAGCA CCGTGTTGAT TATGCCCAAC TTTATTTCCA ACAGGTCCAA 660 

AGAAATGTTG CATTGCAGGA TTTGGATTAT CAAAATCTGC CACTTCTGTA GCTTTCCCTA 72 0 

CGGTATTATC ATCGCCAAAT TTATAAGCAT CGTAAAGCAA AATATTTCTA TAAAGTTTTT 780 

CACGTGCATT GTCGTCTAAA AT ACG AT AC C AATAATCGTA GTGATCTCGC TGACGTTTGG 840 

CTGTTTCACG CGCATTTTCT TCAACAAAAT CATTGAGAGC CTTGCCCGCT TTATGGTCAC 900 

TACTGCGGTA GCGATCATAA GCTCCAAATC CT AG AC TAG A CATGGTCGAG ATGACAAATA 9 60 

CGGATCTCTC TGGCAAGGTC AGGAGAGGCA AGACCATATT GCGGTATTTC CATGTGGCAC 102 0 

TCGTGATACG AT C AT AAAC A CCGATAGAAT ACTTGGTGCC AGCTAACCCT TGCTTCGTTT 10 8 0 

TCACCTCTTC GATAGTGGAT TTTTCTTCGA CAATGTAAGC CTTAGTCTCT GATTTAAACC 114 0 

AGTCATTATT GCTTGTATTT GGTAAAAAGA CTTTTCGGTA ATGTTCCAGC GTGCTAAACA 12 00 

AATCTGTCGT TCCATGTTGA CTGGCAAGAC TG AT AC CAT A AGT AT C G AC A TTATTCTTAG 12 60 

CTAGAAGATT GTTAAAGCCA GATTTACCCA AC T C AATC AG AGTATCTAAT GGTGAAGCAT 132 0 

TCCCCTTACC AAAGAAGTCC AAAT GGT AC A GAACTAGGTC TTTGACATTC ACCTGACCAT 13 80 

AGCTAAAGTT ATACCACCGT TCCAGATAGG TCAAGCCAAG TAGCAAGGCT TCCTTGTTGC 144 0 

GTTTGATTTT ATCTACAAGA TAACCTTCAG TGACGGGGTT AGCACTAGCC AGTCCAGCAT 15 00 

CCGCTGACAA GAGTTTTTTC AAACTGTCTT CCAGTTGTTG TTTTGTTTTG GCGAACTGGT 1560 

CT TC T AG AT A GAGCTCAGTT TGCTTGACGT TTGGAGAAAT ACCCAGCGTC TTTCTGATGG 162 0 

CTTCTGAATG ATAGTCAACC TTTTGTAAGT CAGGTAAGAC TTGCTTGATG ATAGAGGTTT 1680 

GGTCATACAG GAATTGGTTT GG CGT AT AG A GAAGTCCAGT ATTGCCCAGA CTATATTCTG 1740 

CTAATTTGGC GAAATCATTC TGGTATTTGA GATCCAGCTT C TC AG ATAAA TCATCCTTGT 1800 

AGTGAAGCAA GAGTTTGTTT GCAGTCTGTT TGTTAGAAAC AATGTCTGTG ATGACTTGGT 1860 

TGTCCTTCAT CATGACTGCT GACAAGAGTT CTTTTTGATA TAAAAGACTG TTCTCATTGA 192 0 

CCAGGTTTCC GTATTTGACG ATGGTTGCCT TGTTGTAGAA AGGTAGCAAT TTTTCAATGT 1980 

TTTTATAAGT CAAGTTGCGC TTAGCTTGAT AATAGGCCAC CT T AG AAAAA TCACTGTCTT 2 040 

TTTTGCCACT TGTTGAAAGT GGCTCCACTG TTGGTAAAAT GAG AGG AT T G ATTTCTGCTT 2100 

TTTTGCTTGC AATTTGAGAA GCATCTAGCA TTGTTCCTCT TTCTTCAAAG GATTCCTTGC 2160 
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TGACGACCTC ATCCTTGACC AAGGTGACAT T GT AG AC T C T GTTGGCCTTG CTGCTGAATG 2 22 0 

TGTCCTTTAC CTTCATTTCG TTATAGTGGT AACCAGTGAT GGCATTTCCG TTGGTTACAT 22 80 

TAACATCGCT GAGAACATTG GTCAAACTTC CAGCATGCCT AACATCACCA GAAGTTCGAT 2 3 40 

CCCACAAATT GCCTGCCACT CCAGCGACTC T AC C AAAGTG CTTGACATTG TTGATATCAC 2400 

CTTCAGCATA GCTATCTTGG ATCTGTGCAT CTCGGTCTAC TAGGCCTGCA AGTCCACCCA 24 60 

CAGTCTGATC TGAAGTATTT GTGTTAGATG AAATGGCTAC TGTCGCTTTT GACTTAGTAA 2 52 0 

GTAAAGCCTT GTCACCTGTC AAATG AC CG A CCATACCACC G AT AT TGT AG GCAGCAGTCG 25 80 

TTTCATAAGT GTTGATAATT CTTCCCTTGA AACTGCTCTC TGTGATGCTT G AT TGCT C AG 264 0 

CCTTAGCCAG CAAACCACCG AT AC C AC GT T CACCAGCCAG AACACCATCG ACGTGAACTT 27 00 

GCTTAATTTT TGTGTTATTC TGAG CTT CAT TTGCCAGTGA ACCGATATCA TCTTTCCCTG 2760 

AAATAGCAAC ATTTTTTAGA CTCAGTTTTT CTACTGTAGC ACCACTCAAG TTTTCAAACA 2 82 0 

GAGGTTTTTT CAAATTATAG ATAGCATAAT TCTTGCCATC TTTTTCACCG ATTAAACGAC 2 880 

CAGTAAAGGT GTCCTTGATA TAGGATCTTT CAT C AGG AC C AAGCTCCACT TCGTTAGCAT 2 94 0 

TCAGGCTGGC CGCTAAATGA TAGGTTCCAG AGGGATTTTG GTTTATAGCT TTGACCAGAT 3 000 

TACTAAAGGA AGTAAAGTTT GTTGTTTCTT CTGTTCCCTT CTTAGCTAGA TAGAAGGTAA 3060 

AATTATCTTT ATATCTGCTT TCTATCTCCT GCTGAAGCTT CTCTACTTTT GCTGTGATTT 312 0 

TATAAAGGAT TTTATCATTT TTTCTTTCCT CTGATATTGA TGCTACTGGT AGGTATACAT 3180 

CTTTGAATGA AGAAGATTTC ACTTTAACAA AGTAGCTATT TGGATTGCTT GGAACTTGCT 3 24 0 

CTAACGAAAT GTGTTGTTTA T AAGT AC C AT TTGACAAACT GTATAACTCT AGGTCGGAAA 3300 

CATTTCTTAA TTCAAGTGTT TTCTCTGGTT CTTCTACCTT TTTATCAGGG TCTAGTTCAT 3 3 60 

TTTCTTGTTT AATTTCTTCG TTTCCATTTG AATTGGATGT GTTTGATTCG GTTGAAACAT 3420 

CCTCAGTTGA ATTTCCGTTT GATGGTTCTG GTTCTGTTTG TCCATTCTCT GATGTTGTAT 3480 

TACCTGAATT TTCTGGTTTT GTTGCAGTTC CGTTTTTTTC TGGTTGATTT GATTCTTCAA 3540 

CTGGTGGTTT TGAATCACTA GGTTTATTGG ATACTTCTCC AGTATTTTCG TTAGCTATTT 3 600 

TCCCAGAGTT TGTTTGTGTT TCTTCTGCAG GTTGAACTGG TTTTTCTGTT TCTTGATTTG 3 660 

AGGTACCTTC TACTGTGCCT TCATTTGGAT TTACTGGAAC TTCTTCTACA GTTTTTTCTG 372 0 

AATTTTCATT TTTAGAGTCA TTATGTTCTG GTTTATTTGA TTCTCCAACT GAGGTTGTCG 3780 

AATCACTAGG AT T ACTGG AC ACTTCCCCAG TATTTTTGCT AGATGTATCT GGTGATACTT 3 84 0 

TCTCTGAATT CGTTGTTGAT TCTTCTGCAG GTTGAACTGG ATTTTCTGCT TCTTGAATTG 3900 

AGGTTCCTTC TGTAGTACCT TCATTTGGAT TTACTGGTGT TTCTTCTGTT GGTTTTACTG 3960 
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GAACTTCTTC AGTTTTTTCT GGACCTTGTT CTTTGGTCTT CTCAACCGGA GTTTCAGGTT 4020 

TTACTTGCTC AATATTACCC TTATATTCTG GAAGCGGTGC TACCTGCTCT GGTTCACCTT 4 080 

TAT C AC TT AC CACAGTATCT GGCGACTCTG GTTGAACCTC AGTCTCACCT TTGTCGGTCA 414 0 

CAACTGCTTC GGGTAATGTA GGTTGAACTT CTGGTTCGCC TTTGTCACTT ACTACAGCTT 4200 

CGGGCAACTC AGGCTGAATT GCGGGTTCAA CAATAGCTCC AGACTGTACG TCCTTATGTT 42 60 

CTACACCAGT CTCAGGTTGT TCCTTTATAA CTTGAGTTTT TTTAGTACCT TTTTCGACTA 432 0 

TTCTTGGACT AGGCGCAGTC GTTGAAGTTG AAACAATTTC TCGCGAAACT TCTTCCTTGT 4380 

T T AC AG AG AA TATTCTGACG ATTTCAACTT TCTTACCTAA TTTACCTTCT TGTTTTACTC 4440 

TTACAGTTCC TTCAGCTAAA TCAGGATTTT CTTGAATTTC TTCTTGAAAA TCTATTTTTG 4500 

TCTCCATAGT TTCCTCACGA TATAAGAGTT CAGGTTTGTT CAATTGACCT GATAAAACTT 4560 

CATCCTGTGG ATTTAATGTA T TT AC CC C AG TCTTTTCTTT TGGAGAAATC TTCTCCTCTT 4 620 

TCTTCGTTTC TAGATTCTTA TGTTCGGCTA ATTGTTCTTG AGAATCTGAA GATTGTTTCT 4 6 80 

CTTCTTTTCT TGGATTGATT AATTCAGTAG AGAAAGGTTT TTCAACTACT TGAACTTCTG 4740 

TCGGCTTAGT TGAAGAAACA GGTGTTTGTT CCTGAATAGC TTGTACTGTT GATGGATGGT 4800 

CTACAAAATT CGGTGTAACA TTATAATCCA CCTTTTGTTG TTTTGTAGGA GTGGCAACTG 4 860 

AACTCTTTTG ATTACTTACT TCAGACTCAG AAGTCGTTTT TCCCTCTTTG AT AT AT CC AA 492 0 

TATAAGTGTA ACCTGAAATC TCTTTAGGAA GAGGTAATTT TT C T C C AG AG GTCAATTCAT 4 980 

AGTCCGTATT GTAATTTAGC AAAAGATGAT TTTCTAAAGC ATGGACTGAA ACTAAGACAC 5040 

CATTTCCTAT CCCTGCAACC AATACTAAAT GTAATACCGT TTTATTCTTA ACCTTTTTCT 5100 

TGGAAACAGC AAAAATTAAA ATTCCCATAG CAGCTAAGCT AGCACCAGCA ACTAGGGCTT 5160 

GCCTCTCATT CTTGCTTCCA GTATTTGGCA ATTCCGCCAG TTGATTTTGA GAATTTAACT 522 0 

TATAAACAAG ATAATAAGTT T CAT CATC AT TCTCCACGTA TGTCGGAATA T CAT AG AC AA 5280 

GCTGCTTCTT TTCTTCTGAT GATAGCTCTG AATCTGCCAC ATATTTATAG TGAACTCCCG 53 40 

CAGTTTCTTG AGCATCCACA GATGAACTAG C T AAT AC AG A CATAAAAAAT AAACTTGAAA 5400 

TCGTTGCAGA TACAAGTCCT ACTGATAATT TTCTAAATGA AAAACGCTCT TGTTTTTCAC 5460 

CAAAATACTT TTCCATTATT CCTCCTTGAA ATAAAATTTA TATATGTTAC AAAGACCTTT 5520 

ATTATATTAG TGTATTATCT ATTATCTATA G AAAAGG C AG TAT AC C T T AA TTATACTCTT 5580 

AATT T AC AAA AAAGTCTTAA AATTGAGATG CGCTTTCATA CTTTGTTTTA TATTATTTGG 5640 

AGGTACAATA ACACCTACCA TGAAATTTAC ACGGTAGGTG TT ACT CAT AT CACTAATCGT 57 00 
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TCTAAAAATG GTTTGAGGCA GTTGAGGAGA ATTCCTTCTA TCCAGCTTCC TTGTGCTGAT 57 60 

GAGCGATGGT CTTCCTGCAG GCTTTTTTTT AGAAAATCTC GGACTTGTTC TGGTGCGATT 5820 

TCAAATTCAA AGGCTTTCAT TTTATAGAAA AAGTCGATGA GATGATCTGA CAGGTATTCA 5 880 

GTTGAAAAGG GTACTTCACC ACTTTTTCTA TATTCTAATA AGAGTCTAGA AAATCGAGCT 5940 

TTTTCTTCAG GAAGCTCACG AAAATAGGAA TTGAGGATCC AAGTCTGCTT CTGTTTTCTT 6 000 

TCAATTGGAT CCTGACTGGC AATTCGTTGG TCTTTTTCCA GCTCTTTTTG GTATTGTTTG 6060 

GCCTTGATAG CTCGTTCTGC TCTATTTTTA CCAAAAAGAA TTTTTTCCCA CTTGCGTTCT 612 0 

TCTTGAGTCA GGGTCTCTGT AAAGCCAAAG TAATCTTGAT AAGCACGCTC TGCGGGTCCC 6180 

ATGGCTAGAA CCAGATTGTC TGCATATTGC TTGGCGATTT TATCCCTCTT CTTGCGTTCT 6240 

TTCTCTGCCT GGATACGGAG TTCTTGTTCG TAGTCAATTT TCTCCTTGCC TAGCTTGACA 63 00 

AGGTAGAGTT GGTCATCCGA TTTCCCAAGT AAAAAGGGTT TGATACACTT TTCAAGGACT 63 60 

TCTTCCATCC GAGCCTTTTT CTTTGGTTCC GCCTTGGTCC AACTTCCTCC CTGAAAGACT 6420 

TCTAGGAAAA GCTGGTAGTC TCTCTCAGGC GCAAATTGAT TGCCACGATT GGGTTTGAAA 6480 

ACACCTTTTT CCCAGAGCCA TTTTAGAAGT CGCTCGTCAA AGTTACTTTT ATTGACCTTG 6540 

ATTTTTTCCT TTTTCTGAGC TTTTCTGGTT AGATTTTCAA CCTTTCTGAG CAGTTTTTCT 6600 

TCCTCTTCCA ATTGCTGGTC AAGGGACAAT CGATGAAAAT GACGAACACA GTCGCTACCA 6660 

ATTGGAAAGA GGCGTTGGCC TGTG AC AC CG TTAAAGAGTT CATAAGCGTA TTTGATGGCA 67 2 0 

T TT C C AC AG A CACAATTGCT ACGGCCGATA CCGTTAAAAA TAAAGGAAAC TTCATTCCAT 67 80 

TCCTTGGTAG CTTGTTCCCA AGTATCCGCT TTCGAAGCCT GTAAAACTGC ATCGTGCAGG 6840 

GATTTTCTAA CTGGAAGTGT CATGAGGTCT CCTTTCTAAT ACTCAATAAA AATCAAAGAG 6900 

CAAACTAGAA AGCTAGCCGC AATCAGCTCA AAACACTGTT TTGAGGTTGT AGATAGAACT 6960 

GACGAAGTCA GCtCAAAACA CTGTTTTGAG GTTGTGGATA GAACTGACGA AGTCAgTAAC 7 02 0 

CATATATACA GCAAGGCGAA GCTGACGTGG TTTGAAGAGA TTTTCAAAGA GTATAAGTTA 7080 

TACTTTTACA ACTTGAACCT CGTCTTTACC GAGTAAAATC AAGTATTTTT CAATATTTTC 7140 

AATCGAATAG GCTCGTGATA AAGCCTCTTC GTATAGAGCT AACTGACCAC GATAGCGGTC 72 00 

TACGAGTTGA CTTGGTTCAT CATAGCGGTC TGTCTTGTAG TCGAACAGAA CAATTTTGTT 72 60 

TTCGTAAAGC AGATAGCCAT CAAGGATACC ACGGACAACA AAGTCTTCCT GACTCTTTTG 732 0 

GTCTCGTTTG AGCATGGAGA AAGGTTGCTC GCGATAAAGA TGGTCGGTAT TAGCAAGAAT 7 3 80 

TTCCTGACCG AGTACTGTGT CAAAGAAAGC AAGAATTTTA TCAAGATTGA TCTTGTCTCT 7440 

GACAGCTTGG CTAGTTTGAA CTTGTTTGAG TGTTTCTGTT AGGC TAG C AA GGGTTAGTTG 7500 
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CTGGCTGAGG TCAATTCTCT GCATGAGTTC GTGAGTAGCA CTACCAATCT CAGCTCCAGT 7 560 

TACCTTTTCT TTGGTTGAAA AATCTGGCAA ATCGAAGCTG ATTTTCTTGC CTACTGACTG 7 62 0 

ACCTTGACCA GCAATCTCGA CACCTTCCAT ATCCATAACT GGTTCGTAGA ATTTCTTGAT 7 6 80 

TTGACTTGGG GTTTGAACAC TAGGAAGTTC AATAGCTGCG CGGTGAAGAG TATTATAAAC 7740 

TTCCACCTCC TTCAGCATTT CCAGAGCTTC TTTGATGGTA TCTGACTGAC GATTGTCTGC 7 800 

TTGGGAGCTA TCTTGGAGAG GACTCTTGGT TTCCAACTCT CCGATAGCTT CTCTGGTCAA 7 8 60 

CTGATCTTCG CCAATAAAAC GATAACTAAA GTTGAGCTTG TCCTTAGTAA ACACTTTACT 7 92 0 

GAT AG C C C AA AGCCAATCTT GGAAATTCCG TGCTTGCAGT CTAGTATTGC TATTTAGTTT 7 9 80 

CCCATTTTTG GCTGCTGGGT. ATTCCTTGGA TTCCAGCTTT TCACGAGAAC CCTTGCCGAC 804 0 

AAG AT AG AG C TTTTTCTCAG CCCGCGTCAT AGCAACATAC AGCAAACGCA TCTGCTCAGA 8100 

ATAGCTTGCT AGCTGTAATT CCTCTTCGTT CTGCCTATAG GTCAGACTAG GAATGGAGAG 816 0 

TTTGATGGTT TTAGGATAGT GGTCTTCTAC TGCCCCTGTC TCCATCTTGG CAATATATTT 822 0 

G AC AC C AAG A CCATTCTGAC GACTGAGAAT GACTTCTGAC ATAGAGTCTT GCTTGTTGAA 8 2 80 

ATCTTGATCC ATATTGAGGA TAAAGACGTA AGGAAACTCC AGCCCTTTAC TCTTGTGGAT 834 0 

GGTCATGAGC TCTACTGCAT CTTTTGGCGG TGCGACGGCC ACGCTTGCCA AATCGTGCTG 8400 

GGCTTCTAAG ACTTGGTCAA TCATACGAAT AAAACGCGAC AAACCTTTGA AATTGCTCTT 84 60 

TTCAAATTGA TCAGCACGCA GTGCTAGGGC AT AG AG ATT G GCCTGCCTAG CAGGACCATT 85 2 0 

CGGCAAAGCC CCAACATAGT CAT AAT AAAA ACGGTCGTTG TAAATCTTCC AAATCAAGTC 858 0 

ATAGAGAGAG TGGGTTTTGG CATACAAGCG CCAAGAAGCT AGGATATCCA TGAATTGCTT 8 64 0 

TAGTTTTTCA GCTAGAGCTG TGTGAATCAA GCCTTTTTGA CTACTTGCCA TTTTTTGTGC 8700 

AT T G AC C AG T TTCTCATAGA GATTTTCGTG --LTT^ATCC TCTGCTTTCT GAAGGGACAA 87 60 

ACGTGCTAGC TCATCCTCAT CAAAACCAAA CATTGGAGAC TTCATAAGGG C AAC C AAG G C 8 820 

GTAGTCTTGC AGGGGATTGT GAATGACACG AAGAGTGTCT AG C AT G AC T T GCACTTCTAG 8880 

GGATTGGAGA TAATTGTTTT GCTCTCCGTC AGTTTTGACA GGAATTCCGT ACTCAGACAG 894 0 

GGCGAGGAGA ATCTGGTCAT TACGACTGCG GCTGGAGGTC AGAAGGGCAA TTTCCTTAAA 9 000 

GGCAACACCT TTTTCTTGAT GAAGTTTCAG AATCTCCTTG AT AAC T AAG C GCATTTCGCC 90 60 

TGTTAGTTTC GTTTCTGTTT GACTCTCTTC TTCCTCACCT GTATCGTCCT TGT CGT AG AG 912 0 

G AG AAAT G C T GCCTTGTTGT CTGGATTGGG AGTCAGTTTG GTATTGGCAA AAACAAGCTG 9180 

GTGCTTGTTA TCATAGTTGA TTTCGCCGAC CTCTTGGTCC ATGAGACGTT C AAAG AC AT C 92 4 0 
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ATTGGTTGCT GACAGCACTT CTG AAC T ACT ACGGAAATTT TCCTTGAGGA TAATGAGCCT 9 3 00 

GCCTTCTTGG GGATTTTGCG CATAGCGTTG GAATTTCTCA TTGAAAATCT GCGGGTCTGC 93 60 

CTGACGGAAA CGATAGATGG ATTGCTTGAT ATCTCCCACC AT AAAGC G AT TGTGGCCATT 942 0 

AGACAACAAT TCCAGCATCC GTTCTTGAAT ATGGTTGGTA TCCTGATACT CATCGACCAT 94 80 

GACTTCATGG AAGCGCTCCT GATAAGACTC ACGAACTTGT GGGAAATTCT CTAAAATCTC 9 54 0 

AATGGTGTAA TGGCTGATAT CAGCGAATTC GAAGGCATTT TCCTGTCGTT TTCTCTGACG 9 6 00 

ATAAGCCTCT AC AAAAT CGC TCATGAAAGA TTGGAAGGTT TTAGCTAGTT TCCAAGTGTC 9660 

TCCATGATAA CGTTCTTGAT AGTCGAGAAT CGCTATCTGG TCTGATAATT GTCCTAGTTT 9720 

AGCAAACTGG GTCTTTCTCT CTTCGTTGTA GGCATCAGCC AGGGGCTTCA AATCAGCCTA 97 80 

CGGCTGGCAT TAGTCAGAGC TCGACCGTTT TTCTCCTTAG AG ATGG C G AC AACACGCGCA 9 840 

AGCACTGCCT GATAAGCCTG ACTATCGGAC TCCTGATTTA GGGAGCCAAT TTCATCCAGA 9 900 

ATT AAC TG AA CATTTTCTAA ATAGGCAGCC TTTGCAAACT CCTTGGCATC GTTATCCAGA 99 60 

TGGTAACGGA AAAAGCTTTC CAAATCCCAA AGGGCTTGTT TGATTTGCTC GGTCAGTTTT 1002 0 

TCTTTTTCAC TGGTAAAATC AGCTTTCTCA AATCCTTTGA GGAAAGATTC ACTCAGCCAC 10080 

TTTTGAGGAT TACTGGTGGA TTGGAGGAAG TCATAGATTT TAT AG AC C T G CTGGCGCAGA 1014 0 

CCCCGTTCGT CCTTGCCACG CCCAGCAAAG TTTTTCAGCA AATGACTAAA GGTCTCTTTC 10200 

TGTTTACCTT GGTAATGCGC TTCAAAGACC TCATGAAAGA CTTCGTTTTC GAGAATAAGT 10260 

TGCTCGCTTT GGTTTTGTAA AATACGGAAA TTAGGTGCAA TATCAAGCAG ATAACCATGT 10320 

TTGCCAAGGA ATTTTTGTGT GAAAGAATCC ATGGTTCCAA TGGCAGCGTT GGGTAGGTCT 103 80 

GCCAACTGGC GACCCAAGTG TTGTTTGAGG TCGACATCAT CTGTTTCTTG GATTTTCTTG 10440 

CTGATTTTTT TCTCTAAACG TTCTTTAAGT TCAGTTGCAG CCTTGACGGT AAAGGTTGAG 10500 

ATAAAGAGTT G AG AAATTT C GACACCACGC GCCAATTGGT C C AG AATGCG CTCTGCCATG 10560 

ACAAAGGTCT T T C C AG AAC C AGCCGATGCT GAGACCAGGA TATTCTGGGC AGAAGTGTAG 10 620 

ATAGCTTCGA TTTGCTCGGC AGTTTTCTTC TGTTCCTTGC TCGAATTTGC TTCTGCTTCT 10680 

TGCAGTTTTT GAATCTCCTC CTCACTTAAA AAGGGAATAA GCTTC AT C G A TTCAACTCCT 10740 

CTCTTATTTT TTCAAGCCAA GCTTGCTTGA GTTTTTCTCC G AC C AG AC GC TTGCCATCAG 10800 

CTAGGTCCAA CTTTTCTAGG AAACGGGCTT GG C C C AG AT G GTAATTGGCT TCAAAGCCTG 108 60 

TAATAGCCTG ATGTTGCTGG ACGTATGGGG CAATGCTTCT GCCATTTTCA GTATAAGGAT 1092 0 

TGATGGCGAA CCGGCCTGCT AAAAT CTTCT CAGCAGCTTT CTTGTAAAGA TAGGCATTGT 10980 

AGTCCAGTAG GAGCTGAAAT TCCTCATCTG TCAGTTGATT AGCCTTGTTT TTGTTATAAA 11040 
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ATTCGCCTAA ATAACTGCTT TCTTTTTCCA AGAAGAGCCC TTGGTATTTC AT AG AT T TG C 11100 

TGGCTTCTAC CACTGCTCCT GCCAGACTTT TTACCGCCAT CAGAGATTGG ACAGGTTCAG 11160 

CCATTTCCAA GTACATGGCG CCGAAAAAGT TCTGCTCCCC TTCTCTTTTT AGGGCAGCAA 11220 

GATAGGTTGG TAACTGAGAA TTGAGCCCAT TAAAGAAATG AGGAAACTGG AACTGAGTCA 112 80 

GACTGGATTT GTAGTCTACT ACTCCTATCG CT C CAT TAG C TTTCAAACGG TCAATCCGGT 11340 

CCACCTTGCC TCGTACAAAG ACACTGCGTC CATTGTCTAA TTGAATAAAG GCTTGGTCTT 114 00 

TTCCACCAAA ATTTGCTTCT TCTTTGATGG TTTCGATGGC TGGATTGTGT CGGAGAATAT 114 6 0 

GTCCAGTTGT CCGTGCAACA TCAAGCAAAA CTTCCTTGGT AAACTGGGCT TCCAAACTTT 1152 0 

CTTGATAAAT AGCTTCAAAT TCGCGTTCTT GACTGGTTTC TTGAATAGCT TGTTCTAGAC 11580 

GTTGGTCAAA GGAATCTTCA TTAGGCAACT GTAAGGCGCG TTCAAAGATA CGATGCAAGA 11640 

AATTCCCGTG ACTACGGGCA TCAGGATGCA AACGTAATTC CTCCTGCAAG CCTAAAACGT 117 00 

AGCGTAGGAA ATAACTGTAT TCATTGCGAT AAAACTCTGT CAAACCCGAC GTAGACAGGT 117 6 0 

AAAACTCCTG TTTGGCAGGA TAGAGAGCTT GCAAGGTGTC CTTGGCTAAG GTCTTGCTGC 1182 0 

TTGGACTGGT TGGGATAGCT GGATTTTCCA GACCTTGCTG ATCTAGTTTT TTACCTATGA 118 80 

C AC G CG AC AG AACCTTGACA AAAGTCAAAT CTTGCTCAGT ATCGCTCATC TCACCCTGCT 11940 

GGTGATAGGC AACCAGACTA GACAAAAGAC TGTGATAGGA CCCCATATCC TCCTTAGACA 12000 

GTCCTTTGTG ATTCATCCTC TTCTCTCTCC GCCTAAATCC AAAATGGATC AACTCTTGAA 12 0 60 

GATAGGCAGA TTCCTTACTT TCACTTTCGT TAAAAAGGCT TGGAGCCGAC AAGAACAACT 12120 

GCTTACGAGC AGAATTGACC AAGGAAAGCA TAGTGTAGCG ATTTTTCTTG AGATTTTCAC 12180 

TGCTGGCAAT CAGTAATTGA ACGCCTTCTT CGGTCGCTTG GTTTAGGTTT TGCCTTTCTT 12240 

CAT C T GT C AG AAGACTGGTG TTTTGAGAAA TTTTTGGTAA ATTGTCCTGA GTTAGTCCAA 12300 

TAG CAT AG AC AAAGTCAGCA GTCAATGGTG CAATCAAATC GTAACTCTGC ACCAGAACAG 123 60 

TGTCCACTGT TGCTGGAATG GTACGGTATT GGGACAAACT CAT T C C AG AA TGGAGCAAGG 12420 

CTAGGAAGTC TTCCAGACTA ACCTGTGAAC CAGCAAAAAC AGTCGCAAAT TGTTCTAAAA 124 80 

CATGGCAGAA AGCCTTCCAA ACTTCGGCTT GTCTTTCCTG TTCTACAGCT TCCAAAGTGG 12 540 

TTGTCAAATC TTGTAACTGC TTGGTCACAG CTCCTTCTTT TAGAAAGACA CTCCATTTTT 12 600 

GTAGGAGTTT TTCAGCCTTT TGTTTTCGGC TGGCAAAGAG GGTTTCAAGA GGTGCTAAAA 12 660 

TTCTCAGGCG GAGGACATTC AAACGCT C AA GATTAAATTT TCCATGGTGG GATTTGGTGA 12 72 0 

AGGTTTGCTG AAAGGCTGGC AAGCCATTGA TACCAAGATA G CGG AT AT AT TGCTCAAAAG 12 78 0 
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CATCAATATC AGACTGACTG AGGTCAGTAT ACAAATCAGT TCTAAGAAGA TTAATCAAAT 12 84 0 

CCTCCTGACG AAAACGGTAA CGTTTTAAAG CTAAAATAGA CTCGACAAAC TGAGTCAAGG 12 900 

GATGATGAGC CATGGCTTCG CTTCTACCAA GATAAAAAGG AATCTGATAC TGGTCAAAAA 129 60 

TGGTTTTGAG AGATAACTGG TAAGAAGCTA CATCCCCCAA GAGAATACGA AAATGCTTGT 13 020 

AGCTCAGGTC TGAGTTCTCA TGTAATTTCT G AC G AAT ACT ACGGGCTACT AGCTCCAACT 13080 

CCTCCTTTTG CGTCAAACAA GACCAGATTT GTAAATTTTC ACGGTCTTTC T C AT C G AC AT 13140 

CCAAAGCGAG TTCTGAAAAG TCATAAGAAG ACTCCAACAA ACGAGAGGCC TTGTCAAAAC 13200 

TATCCATCTT CTCATGAGTT TGAGAACAGT CCTGAGCAGG CGTTTGGTAT TTAGAAGCCA 13 2 60 

GAT GAT GG AG AAATTTTACG CTGGCTTGGT AGAGATTGCC CTCGCTAAAA GGACTGGTAT 1332 0 

AGGCTTTCTT ACT AG C AT AA GCCCCGATAA CAATCTCAAC ACCTTTGCCG TGAAGTAAGT 13380 

CCACAACCCG CTCTTCCTCA GCAGAAAAAC GAGTAAAGCC GTCAATGACC AAGGCGATTT 13 44 0 

GATTAAAATC ACT AC T T AC C TTGTCATTCT CAATAGCCTC AATCAAATGG GACAACTGAC 13 5 00 

TTTCCTGGGC TAACTGACCT TGATTAAGAT AGGCTGTTAC TTTCTCAAAA ATCAAGAGTA 13560 

AATCCGCCCT CTTATCCTCA TCTGTTAAAT TCTCCAAGTC CAAAAAACTC AT C T GAG AT T 13 62 0 

TGGTCATCTC ATGGTAAAGC TCAATTAACT GCTGGATCAA TTGAGGATCC TGCTTAATAG 13 680 

CGCCATAAAC ACGCAAGTCC TTGGGATCGA GTTCGGCAAG GCATTTGTAA AAGGCCAACC 13 74 0 

CAAGACCGAT ATCATCAAGA GTAGTTTTAG CTGGTAAATC ATTCAAGACC AGATAGCGAG 13 800 

CCATTTGAGC AAAGCGCGTG ACGGTAATCG AAAAAGAAGC CTGCTGGGAC AAGTATTCCA 13 8 60 

GCACGGCGCG TTCCTTTTCA AAAGAAAGAG AGTTGGGGGC AATGTAGAAG ACCCGCTTGC 13 92 0 

CAGCTGCAAC TAGCTCTTCT GCCTCTCTTG TTAGAATTTC TGTCAAAGAA GTCCGAATAT 13 9 80 

CAGTATAAAG TAATTTCATC TCAGCCTCGT TGGAATTTTT CATCACCCTA TATTATACCA 14 04 0 

TGATTAGCCT CGTAAATCTG TTAAAATATT TAGGCCATCC TTTCTTTTCT TCATCATCTG 14100 

CTAAATCTTA AATACTTAGC TTTACTTGTA TTAGATAGAA TAAGTCTGGC TACTGAAAAT 14160 

CACATAATAA AAAAGCCTCG GTAACAAGGC TTTGAGTTTT ATGATTGTTT CTTAGGTACG 1422 0 

GAATACACTT CAATGTGTTG TCCCAGTATC TTAATGTCGA CTGGTAGATT GTCTGATTTA 142 80 

TCGCCATCAA CATCGGACTC T AAT TCGAT A TCAGAAGAAG TTTTAATATT ACGTGCCTTT 143 4 0 

ATATATTCAA TATTCTTGAT AGAATGATTG AACTATAGTA AATTGAAACT ATAATAGTAC 14400 

ACCGTGGATG CTAAAATATT TCTAGAAATT AATTTGATTT CCCTAATCAA GCTATTCGTA 144 60 

TCTTATTTCA AT CT ACT AT A ATAAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 1452 0 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTCTC TCTTATTGTG AGCGAACAGG 145 8 0 
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TAGTATAACA G AAGC AT C AC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 14 640 

AAAGCTAAAA GAGAAAACAG G AG AGC T AAA CCACCAAGTA AAAGGAACAA AACCAAGAAA 14700 

AGTTGATAGA GAT AG AC T T A AAAACTATCT TACTGACAAT CCAGATGCTT ATTTGACTGA 147 60 

AATAGCTTCT GACTTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 1482 0 

CTACACTCGA AAAAAAGAAC CACACCTACT ATGAACAAGA C C C AG AAAAA GTAGCCTTAT 14880 

TTCTTAAGAA TTTTAATAGT TTAAAGCACC TAGCACCTGT T TAG AT T G AC GAAACAGGAT 1494 0 

TCGATACTTA TTTTTATCGA GAATATGGTC GCTCATTAAA AGGTCAGTTA ATAAGAGGCA 15000 

AAGTATCTGG AAGAAGATAT CAGAGGATTT CTTTGGTTGC AGGTCTAACA AATGGTGAAT 15060 

TAATCGCTCC AATGACTTAC GAAGAGACGA TGACGAGCGA CTTTTTTGAA GCTTGGTTTC 1512 0 

AGAAGTTTCT CTTACCAACA T T AACC AC AC CAT CGGT T AT TATAGTAAAA TGAAATAAGA 1518 0 

ATAGGGGGGG GGGGGGAGGG GGGGGGAGGG AGA 15213 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27: 

TTATTACCTG AAACATTAAA TTTAATTGGA CATCCCGTTA TCAATTTTAT AAT AT CAT C A 6 0 

AGATTTTTAT TATCTGATTC AGGAATTTTA TCTGATATAA CAACACCATT TTCAAGATAG 12 0 

TTCATTAAAT TATTTGATTC ACTAACATTA GTGTTTTGAT CTCCATCAAG CCAAAAATAA 180 

TGGTTATCGG AATCTAAATA CGATGAGTTT AAAATATTAT TACAAATTAT TTGATTTGCT 24 0 

CCACCAGGAA TATATCTCAC TACTAAATTC TGTTTAAGAT TCTCACTACC TG AAT G AG TG 3 00 

ATAACAAACT CTAGAATATA TTTAGCTAGT CTATCTTCAA CATAAATCAT CTTCCTAGAA 360 

TGATACACAT CACCTAATTC AAAAAATGCA T CC TG AT AAT CAATATTTTC AAT AAC AT C T 42 0 

ACCTTTTCTC CGTTTTTCAC TAAAAGTTTC ACGGCTTCTC TAGGAAAATC TTTTATAAGT 4 80 

TGTGTAGAAT GTGTAGTGAT AATAATTTGA TGTTTTTTAT TTAAACACTC TTGAAGTAAA 540 

AACTCTTTAA ATTTATAGAT TGCACTCGGA TGAAGTGAGA TTTCAGGTTC AT CT AT T AAT 6 00 

ATTAATGAAT TTGATTGCGC ATTTACTATA TCATTTACTA ACAAAATAAT TCTAGCCTCA 6 60 

CCTGTTCCTG CAAAAGCCTC GGAATATTCT TT T C C AG AT T TTTTCATCCA AATAGTTTTG 720 
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GAAGCTTTTA TAT CAT C AC C TTTTGAATAC AACTTATGTG TTAAAATTTG AATGT CTGT A 780 

TAAGATTCAT CCATTATTTC ACTAATAATT TCACAAACTT TATCATCAAC TTTAACATTA 840 

TCTATAACCA TTTCCTTTTT ATAACGCGTA TAGCTACTTG TAT TAT T C TT TAAAATATCA 900 

GCAACTGGCT TAGATCGTAA TCTTATAAAA TCTTGTTTAC TACGTTGAGT AGAAATTTTT 9 60 

TTAAAATTAT AGTGATAGAA AAATAAATCA AAAGCAGAAA CATATTCTTT ACAATCACAA 102 0 

AAGACAACAT TTTTTTCAAT GCCATCCCAT CTGTCTGTCG AAGAACTTCC AATATATTTA 10 80 

TTTTTGGGTA ATCTTTCCAT CTCATATTGT TTTTGAGGAG CATATGGTTC CCAATAATCT 1140 

AATCCTTTTT TTGTTCCAGA ACGGCCTTTA AGAACTTCTA CATTTCTAGA AGCTTTAATG 1200 

TTATAATATG AATAGATTAA ACATTGTTTC CCATCCACTT CATCTATTTG AT C AAC AT TT 12 6 0 

GTACTAAACC AATATTCAGA CACACTTTTA TTGGCTGGAG AAC CAT AT AA AGCTTGTAAA 1320 

ATTGAAGTTT TATTTACTCC ATATCTATTA CAGACACCTC AGGATTATTT AACTTATAAG 13 80 

TTTTAACAGC TACGGAATCA AT T T C AAC AG CAACTTGAAC ATCTATGCCT GATTTTTTAA 1440 

GGCCACTTGT AGTGCCACCT GCACCGTTAA ATAAATCAAT AGCAACAATT TTCCCCATAG 15 00 

TATTCTCCTA AAGTTTCTCC TTTTTATTAT AAC AT TAT C A AATGT AAAAC CCAACCCGAT 15 6 0 

AGGGTTAGGT TTTTAACATC ATTTCACCAA CTTCTTCATC TCATCAATAC GTGCGACGGT 162 0 

CGCGTCATAT TTAGCTTGGT AGTCAGCTTG TTTGTCGCAT TCTTTTTGGA CGACTTCTGG 1680 

TTTGGCGTTG GCTACGAAGC GTTCGTTAGA GAGTTTCTTA CCAACCATGT CCAGTTCTTT 1740 

TTGCCATTTA GCAAGTTCCT TGTCGAGACG GGCCAGTTCT TCTTCAACAT T G AGG AG AT C 1800 

GGCCAGTGGC AGGT AG AT TT CTGCTCCTGT GATGACACTT GACATAGCCA GTTCAGGTGC 18 60 

AGGGATGGTT GATGCGATTT CCAAGTGTTC TGGATTTGTA AAGCGTTTGA TATAGTTGAC 1920 

ATTGCTGTTA AAGAAGGCTT CCAAGTCGCT ATCGCTTGTC TTAACAAGGA TGGTGATAGG 19 80 

CTTGCTTGGT GCTACATTTA CTTCCGCACG CGCATTCCGA ACAGCACGAA TCAAGTCTTT 2 04 0 

GAGACTTTCC ACACCAGTGT GAGCCGCAAG GTCTTCAAAG GCTAGATTAA CAGTTGGGTA 2100 

TGCAGCTGTC ACGATAGAAC CTTCTGAGAT TTGTCCAAAG ATTTCCTCTG TCACGAATGG 2160 

CATGATTGGG TGAAGGAGAC GAAGGATCTT GTCCAGCGTA TAGAGGAGAA CAGATCGAGT 222 0 

AATGACCTTA TCGTCTTCAT TGTCGCTGTA TAG AAC T T C C TTGGTCAACT CAACATACCA 2280 

GTTGGCAAAT TCTTCCCAGA TGAAGTTGTA AAGGATATGA CCAGCCACAC CAAACTCGAA 2340 

CTTATCAAAG TTTTCAGTAA CTTTTGCAAT GGTTTCGTTG AG ATTGT GG A GAATCCAGCG 2400 

GTCCGTCACA TTACCAGCCT CACCTGTTGC AACTTTTGTG AC AT TGT CAT GCGCCACATC 2460 

CAGCGTCAAA CCTTCATTGT TCATGAGGAT ATAGCGAGAA ATGTTCCAAA TTTTGTTAAT 2520 
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AAAGTTCCAT GAAGCATCCA TTTTCTCGTA AG AG AAAC G A ACGTCTTGAC CTGGTGCGGA 2580 

ACCGTTTGAA AGGAACCAAC GAAGGGCATC AGCACCGTAT TTCTCGATGA CATCCATTGG 2 640 

GTCAATCCCG TT AC CG AG AG ATTTAGACAT CTTGCGTCCT TGCTCGTCAC GGATGAGACC 27 00 

GTGGATAAGC ACGTTTTGGA ATGGCTGACG ACCAGTAAAT TCCAAGGACT GG AAG AT CAT 27 60 

ACG AG AC AC C CAGAAGAAGA TGATGTCGTA ACCTGTTACC AAGGTTGAAG TTGGGAAATA 2 820 

ACGTTTAAAG TCTTCTGAGT CGACTTCAGG CCAGCCCATG GTTGAAAATG GCCAGAGGGC 2 880 

AGAACTGAAC CAAGTATCCA AGACGTCTTC GTCCTGAGTC CATCCGTCAC CTTCTGGAGC 2 940 

TTCTTCGCCG AC AT AC AT T T CACCATCAGC ATTGTACCAG GCAGGGATTT GGTGACCCCA 3000 

CCAAAGCTGA CGAGAGATAA CCCAGTCGTG GACATTTTCC ATCCATTGAA GGAAGGTATC 3060 

GTTGAAACGA GGTGGGTAGA ATTCGACCTT GTCCTCTGTG TCTTGGTTAG CAATGGCGTT 312 0 

CTTAGCCAAT TGGTCCATCT TGACGAACCA TTGAGTAGAC AAGCGTGGCT CAACTACGAC 3180 

ACCTGTACGT TCTGAGTGAC CAACACTGTG GACACGTTTT TCGATTTTGA CAAGGGCACC 3240 

GATTTCTTCC AACTTAGCAA CGACTGCCTT ACGAGCTTCA AAACGATCCA TGCCTGAAAA 3300 

TTCAAAGGCA AGCTCATTCA TAGTTCCGTC GTCGTTCATG ACGTTGACTT GTGGCAAGTT 3 3 60 

ATGACGTTGG CCAACCAAGA AGTCATTTGG ATCGTGGGCA GGTGTGATTT TCACGACACC 3 42 0 

AGTACCAAGC TCAGGATCTG CGTGCTCATC TCCAACGATT GGGATGAGTT TATTAGCGAT 34 8 0 

TGGAAGGATG AC GT TTTT AC CAATCAAGTC CTTGTAGCGC GGGTCTTCTG GATTAACCGC 354 0 

AACCGCAACG TCCCCAAACA TAGTCTCAGG ACGAGTTGTA GCAACTTCAA GGGCGCGTGA 3 600 

ACCATCTTCC AGCATGTAAT TCATGTGGTA GAAGGCACCT TCTACATCCT TGTGAATCAC 3 6 60 

CTCAATATCA GAAAGGGCTG TGCGAGCTGC TGGGTCCCAG TTGATGATAA ACTCACCACG 3 72 0 

AT AG AT CC AG CCTTTCTTGT AAAGGTTCAC AAAGAC C TT A CGAACAGCTT TTGACAAACC 3 7 80 

TT C AT C AAG A GTGAAACGCT CACGAGAATA GTCTACAGAA AGCCCCATCT TGCCCCATTG 3 84 0 

TTCCTTGATG GTAGTGGCAT ATTCGTCTTT CCATTCCCAG ACCTTCGTCA AGAAAGACTC 3 9 00 

ACGACCTAGG TCATAACGCG TAATACCCTC ACCACGTAAG CGCTCCTCAA CCTTAGCCTG 3960 

AGTCGCAATA CCAGCGTGGT CCATACCTGG AAGCCAAAGG GTATCAAAGC CTTGCATGCG 4020 

TTTTTGACGG ATGATGATAT CCTGCAAAGT CGTATCCCAA GCGTGACCAA GGTGAAGTTT 4080 

CCCAGTTACG TTTGGTGGTG GAATCACGAT TGAATAAGGC TTAGCCTTTT GATCGCCTGA 414 0 

AGGCTTGAAA ACATCCGCAT CAAGCCATTT TTGGTAACGA CCAGCCTCAA CCTCGGCTGG 4200 

ATTGT AT TT A GGTGAAAGTT CTT TAG AC AT GTGTGTGTCC TTTCTCTATT TTGTTTATTT 42 6 0 
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TATTTTGAAT TTGCTTAGCA GCTTCTTCTG CAGACAAATT CGTATTATTT ATTTTAAAGT 4 32 0 

AGTGGTGCAA CTCATTCGGT TGATGTTGGG AATTTAATTG AAGTGTTTCA GCGGTCTCTA 4 380 

AAATTTCTCT TTCAGATACC TCAATATGTC GTTTTAAGGG TTTGTGCTTT AATCGATTCT 4440 

CCGTTCGATT TCGACGTATG CACTCTTCAA GACTTGTTTC CAATTCAACA AACAGAATCT 4500 

CTTGATGAAA GTTATCCAAT AAATCCTGAA TTTGCTTTAA ATACATCAGC TGGTACTGAT 4560 

TTGAAAAATC AATTACGTCT GTTAAAATTA CTGATCGCTG ATTTCTTGCA CTTGCTCCAA 4 62 0 

GGAAAGAAAA GGTAATTCCA CGAACAAATT CCCACATCTC CTCGGTATAA TCCTGATAGA 4680 

TCTCTAGTGC AAAATCAATG GCTTGATGGT TATAAAATAG GGTAGCATCC GTCAGTCGAG 4740 

ATAATTCTTG ACCAATGGTC ATTTTTCCTG ATGCTGGAGC ACCAATGATG AAAAGATGCA 4800 

TCAAATCACC TCCCACTCAC TCCTCAGCAA GCCATATCTC AAATCATCAC AGCAGTTGCC 4 8 60 

TTGAGCATCT TTGCGGTCTC TTATGCGAGC TTCGAGGGTA AAGCCAAGCT TT T C C GAG AC 492 0 

TCGTTGACTT TGAAGGTTAT ATCCAAAGCA AGTTAGTTCA ATCTTGTGAA GACCAAGTTC 49 80 

TTTAAAAGCT AG AT C AAT C A AGGAACACGC TGCTTCTGGA AC AT AAC C T C GACCCCAATA 5040 

GTCTGGGTGC AAGGTATAGC CAAGCTCTAG CACATCATCC GCATGAAGAT GGTTGAAGTC 5100 

AACAGAACCA ATGACTTTAT CGGTTCCTTT G AC G AC AAT C C CAT AG C C AG CTGGGAGATT 5160 

TTCCTTTTGA GTACGCTCCG GAAGAATGTG C T C C AG AT AA TAAATCTCAT CTTCCAAGAT 5220 

CTTGACTGGA GGAAAACCTG CTGGATAGGC GACCTCTGGC AAACTAGCGT AGGTATGGAT 52 80 

ATCCTCAGCA TCCACCACTG TGCGGACTCG T AAAAC GAGA CGTTCTGTTT CGATTTTATC 53 4 0 

TGGCAGCTCA GTTCTTGCCA TCCTTCTTCC TCGCTTTTTT GATGAAACTG C C CTTC AT AT 5400 

CTACACGCTT GTCCAGATAG CGATAAACGC GCTGATATCC ATCTCCCATG AAATAGGTTG 54 60 

GGGCAAACAG TTGATTTTTA AAATGTCCCT TTTCATCCAG GAGTTCTGGG GCAACAAGTC 552 0 

GCTCAAGAAT CTTGGCAAAG ATGTGGCAAA TACCGTCTTC CTCAACAATC CTATCTACCC 55 80 

GACAATCTAA AACAAGTGGA CAGGCGTCTA AAATAGGAGT CTGAGTTCGT TCAGAAAT IT 5 64 0 

CATAATGCAC TCCCAAACGT TCCAATTTCT CCTGATGACT GATAAAACCA GCCTGCTCCA 57 00 

TCGCAAGCAT AGAAGTTTCA TCAGAAAT AT TCACAGTAAA TTTTTGATAC TGTTTGATCT 57 60 

GCTCTGCGGC ATTCTCTCTC GCAACGACTC CAATCACAAC CCAATCTCCT AGACTATAAG 5820 

AGGAACTACA GGTCGTGATG TTATAGCCAA AATTCTAATC TTGATATCCT AAAATAAAAA 58 80 

C AGGAAAAC C ATAATATAGT TTACTTGTGT TAAAAGATTG CTTC AT AAC A ACCCCCTTTG 5940 

ACTAAGACGT AAAAGAAAAG CCCTGCCATC TACATGACAG GGACGAATGT GTTTATCCGC 6000 

GGGG 6004 



WO 98/18931 



PCT/US97/19588 



309 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5857 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 28: 
TGTAGAATTC ACGACAATGC TTCGTTGATT TCTGGGTTGA TTTCGTCGCG TTCTGGCAAG 60 
CGAGTCAATG AACCAAAAAT AGTACACAAT GTGGTATAAT CCTTTTATGG CATATTCAAT 120 
AGATTTTCGT AAAAAAGTTC TCTCTTATTG TGAGCGAACA GGTAGTATAA C AG AAG CAT C 180 
ACACGTTTTC CAAATCTCAC GTAATACCAT TTATGGCTGG TTAAAGCTAA AAGAGAAAAC 24 0 

AGGAGAGCTA AACCACCAAG TAAAAGGAAC AAAACCAAGA AAAGTTGATA GAGATAGACT 3 00 

T AAAAAC TAT CTTACTGACA AT C C AG AT G C TTATTTGACT GAAATAGCTT CTGACTTTGG 3 60 

CTGTCATCCA ACTACCATCC ACTATGCGCT CAAAGCTATG GGCTACACTC GAAAAAAGAA 42 0 

CCACACCTAC TATGAACAAG ACCCAGAAAA AGTAGCCTTA TTTCTTAAGA ATTTTAATAG 480 

TTTAAAGCAC CTAACACCTG TTTAGATTGA CGAAACAGGA TTCGATACTT ATTTTTATCG 54 0 

AGAATATGGT CGCTCATTAA AAGGTCAGTT AATAAGAGGC AAAGTATCTG GAAGAAGATA 600 

TCAGAGGATT TCTTTGGTTG CAGGTCTAAC AAATGGTGAG TTAATCGCTC CAATGACTTA 660 

CGAAGAGACG ATGACGAGCG ACTTTTTTGA AGCTTGGTTT CAGAAGTTTC TCTTACCAAC 72 0 

ATTAACCACA CCATCGGTTA TTATTATGGA TAATGCAAGA TTCCATAGAA TGGGGAAGCT 780 

AGAACTCTTG TGTGAAGAGT TTGGGTATAA ACTTTTACCT CTTCCTCCCT AC T C AC CTG A 84 0 

GTACAATCCT ATTGAGAAAA CATGGGCTCA TATCAAAAAG CACCTCAAAA AGGTATTACC 900 

AAGTTGCAAT ACCTTTTATG AGGCTTTTTT GTCTTGTTCT TGTTTCAATT GACTATATAA 9 60 

ATTGTCTAAG CGAAACAACC GATAAGAATT GGCACAAAAG CGACCGTATT TTTGTTACCA 102 0 

ATACAGGAAA AACAGTTCAT AGTTCTATCT TGAGCAAGTC TCTCCAGCGA GCAAACGAAC 108 0 

GCCTTAAAAA ACCAATTCCC AAACATCTGT CCCCTCACAT CTTCAGACAC ACCACTATTA 1140 

GCATCTTATC AGAAAATAAA ATTCCTTTAA AAACAATCAC GGACAGGGTT GGTCATCCCG 1200 

ACTCTGAAGT CACTACTTCC ATCTACACCC ACGTCACAAA GAACATGAAA GATGAAGCAA 12 60 

TCAATGTACT GGATAAAGTT ATGAAAAAGA TTTTTTAAAA AGTTTTGTCC CTTTTTTGCC 1320 

CTCTAAATAC AAAAATAGCC CTTCGGATAA AATCCGAGGG GCTAGAAACG TTGTTAAATC 13 80 
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AACGGCCGAA CTTTTGAATT TCATGGTTCG GGATAAAATA GTTCACTGAA CTATTTTATT 1440 

TTTTAAGGTT ATCATAATAT CAAATAGTTC AATTAAATAC GCTAAATTAC TAATATACTT 1500 

TTTACCTTTT TCATTCTAAA ATGTAAAGTA CAAACAATTA CAATATACTA GAGGGGGAGT 1560 

AAAAAAGGTA TTAAATCGAT GAGTTCAGCA GGCAAGAAAA TAGCACCTTT ACGGGTGCTA 1620 

TTTTTTAATT AACGCCACGT TAACTTTTGA TTGATGAATT TTATTGTTTG GCACTTCTTT 1680 

CATTTCACGG TAAACATCGA TGAAATT CTT TCCAACATTA TTTTTGGAGT TAACTGCATT 1740 

TATTTTTGTA TTAATAACTT TTTTAGTATC GAAAGAATGG TTTAAGAAAT CCATAACTAA 1800 

CTCTCCTTTC TCATCCTGTA ATCAAGATTT TTATCAATGT CAAAATAGTA TTTTCTATCA 1860 

ATCCAAATTG GTCCTTCTCC TTTAGAAATA GCAAGTACAT CTACCGGACC TCCTACTGTT 192 0 

TCAAGAGTGT TGACAATTTT TCTCTTAAAT GAAGTTAATT CAATAAATGT TTTAGCTGTA 1980 

CTCGCCATTT CATTAAGTGG TTGCATTCCA ATAAGGTCTA TTATAGGATT TATATAATAT 2040 

TTTTGCTGTA TAGATGATAT ATTTTCAAAT AT ATT C T C AA TTTCATCACC CAATCCATTT 2100 

TTCTCCATAA CTGATGATAC TTGCTCTGCG ATATATACAT TTAAGTTAGG AT C T ATACC A 2160 

TTCATAATCG TCTCAACCAT CTCTGACTGT GCAAAAGGGA TTATATGACA AGTTTTATGA 222 0 

TG AT TT AT C A CACTTTCATT AATAACTTTC CAAATTAATC GTTTAGAAAA AATT C CAT AT 22 80 

AATTCAATTT GTCTTATAGA TGGAAATATC TCGTCTGTAC C AT AACCTG C TATAACTAAT 2 3 40 

CCAGTTATGT TTGTTGAGTC ATATCCAATG AAAATCGCTT TATATAAAGA TTTAGCAATA 24 00 

ACTTCAACCT CAT CAT C AGT ATGAGGAAAG GATTTAAAAA CATCGTCTAC AATGCTTTTT 24 60 

ATTAACTCTA ACTCAGCTTC AAAAAATTCA AAATTACTTT CAGCTTCTAC TTTTGAAATT 2 52 0 

TCTAAACTAA AATTAGTTAT AGCATTTAAT AAAATTTTAT T AAAAT CAT C TAGAGTGATG 2 580 

GTTTCACCAT TAGAAACTCT TAAATCAGCT GTTTCTTGCG CTTCATAGGC AATGCTGTCC 2 640 

AAAAT ACTTC TTGTACTTCT GACAATATAA TTTCTTAATA AATCCTCAAC TTGTAGATGT 27 00 

TTAAAGGAAA TTAAAAATTC TATTAGCTTT TCAACGTATT GGGCAGTATT AT CT AAT AAA 2760 

TCTGTGCCAA TAGCCTGCTT AAACTCATTT AAAATT AC CT CCCACGGAAT TTCCATAAAC 2 82 0 

GAAGCGTTCC CATATATCAT GATCCCCACG GAATGTTCTT TTGATAAAGT GAATAATTTT 2 880 

CGGGCGCTAT TAAAAACTTT TGAATTTTTC CCGTCTGATA AGGTTACAGC G CT AT C AG AA 2 94 0 

G C C AAT AC AA CACCATTTTT ATTTAATATT CCAATTTCTG CTGTCAAAAT ATCACCTAAA 3000 

CTTTCTAAAC CTGCTCATGC TCTAATGGTA CAACAGCTAA GGTCTTACCA AGACTTGCCA 3060 

AC ACT TTT AA T ACTGT AT C A AGT TGTGGGC TTGTCTTTCC TGTTTCCATT CTAGCGATAA 3120 

CTGGCTGACT AACACCGCTC ATCTCCTCTA GTTTCTTCTG ACTAATACCC TTTTCATTTC 3180 
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TAGCCTCGAT AAGCTCACTC ATGATAGCCA CGCGCATATC ACTTTCCAAA ATTTCCTCTT 3240 

TGCTGAATAA TTCAGCTCTT ACATCTTTCC AGTTACTACC AATAGCATTA TTTTTCATTG 33 00 

TCTAAACCTC TTTCTTTTAA ATCTGCAAGT TCACGTTTAG CTTGCTCAAT CTCTCTTTTG 33 60 

GGTGTTTTCT GTGTCCTTTT CATAAAATGA TGCAGTAAAA CAAAACTACC ATCCATCCAA 342 0 

GCAACAAATA AAATTCTATC TCTAAGTGGT CTCAGCTCCC AAATTTCAGC ATCTAAATGC 3480 

TTAATATATG GTTCGCCTGC GCGTGTTCCA TGTTGGCTTA ACAACTCAAT ATAATCATTA 3 54 0 

ATTTTATTAA GCTTAATTCT GCTATCTTTC CCTTTTTTAC TGGTAAGCTC TCGCATATAA 3 600 

TCAAAAACAG GCTCATTGCC GTTTTTATCC TTGTAAAAAT AGATATTATG CACTATTAAC 3 6 60 

ACCTCTTCCT AATAACAATT ATAACCTAAA AGTTATTGTT TGTAAATACT TTTAAGTTAT 372 0 

TAAAATAAAA AGCACCTAGT TTCCTAGATG CTAGCACAAT GACACGGATT CGCACCGTGG 3 7 80 

CTACCTCTAT CAAGGTGTAC TCCTTCTATA CTATCCCTTG TGCTTTAGAA TATTATACCA 3 840 

CACAATCAAC TAGATACCTA CCATCTCATG ATATACCCCC ATTTTGGGCA AGGGTACAAC 3 900 

GCTAAAATAC AAATCAGAAT AG AT ATT AAA CCACTTATTT AACTTATCAT AAGCTGGTGA 3 9 60 

TTGACTGATA AATAATATCC GCTGACAAGC TCCGATAACA TTCATGTGAT TGTACACATA 4 02 0 

AACCTCTTTT ACAGCCTCTA AAATGTCAGC CTCACTTGTT TGTACCCTAA TATCTGTTAT 4 080 

CTGCTTGATA GTTGCGTATT TTTGATAAGC TAGCATATCT TGATTTTTAG CAGCATCAAA 414 0 

CATTTTACGC T C AAGG AC AC TATACTTAGG TTGTTCTTTA TCTCGCATGA AATACCACTT 4 2 00 

GAGCCATAAA ATCTTTTCTC GGTGTATTAC AGAAATACGC TCAATTTTCT TCTTTGTCAT 4 2 60 

TGCTACCTCC TAAATCATCA ATTTAACAAT TCTAACCACT CACTTTTAGA AATAGTTGCA 4 32 0 

TAGATCTTGT TCGATGTATG ATACAAAGGT TCTAAATCTT TTTCCACCCT AATATAGTTC 4380 

ATCTTATCCT CATGAGTAGG AAAGTATAGT ATTTCCGTTT CATCCTCGTT TAGGATACGA 4440 

TTGCACCAAT CAT C AAT AAT AACTGGCACT TCCCACTCAC GCCATTTTTT AAGGTTTTCT 4 5 00 

AAAAGTTCAT TAT C ACT AAA TAGCTCGCCA TCTATTTGGA AAAATTCCCC TAAGTCATTG 456 0 

TTTCCTTCAA CAATAATAAA C TCTGGC AT A TTTCTATTAC TTAATAACTC CTTGAGTTCT 4 62 0 

TGTAACTCTT TGATTTCCTT TAGATACTTC CTCAATTTCC AACCTCAATT CTTCAATCTG 4 680 

CCTTACTACT CCAAAAATTT CATGGGTCTT AT AAG AT TGT TCAAGTATAG CCTTTGCTGC 474 0 

TTGAGTTCTT ATAAACGGGT TGACCTTACT GTCCATCATA ATATCATTGA GTACAGAAAC 4800 

AGCGTTAGAT GATGCTAAAT AAAGCATTTG AGTTGTTTTA TCCATCATCT CATCTTGCTT 48 60 

TAT C C TC AAT GTCTTTTTAA CCGCTGCAAC TTTTAGATAC TTATGACCTG TTGCGCGTGA 492 0 
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TACCCCTGCT TTTTGACATG CTTTGTCTAT CGTTGGCTCG GTAAGCATGG CATCTATGAA 4980 

TTTAATTTGC TTGGACGTAA GGTTATCATT TTCATTTCCT GCCATCTATT ACCTCCTCAT 504 0 

TATCAAAATA AAGGGTTGCC CCTTTATTTC CCTATGCTAG ATAATTCTGC AATTCTGCAT 5100 

CCATTGCCTC TGAATTGCCC TCAACAATCA TTTCATGCTG TACTAAATCA ATCTTATCTC 516 0 

CGTTAATAAG TAAACCACCG TGGAAATAAT CAATTTTTCT AT C AAGG AAA TGTACTAGCT 522 0 

TTTCAAGGCG TTGCTGTTGG CTGAATTGCT CCATGTCAAT TTCGATATAA GCAAGGGTAG 52 8 0 

TAT C ATT AT C CATAATATCT TCTAATTTTC TAAGAGCTAG AGGTTTATTT TTATATTTTT 5340 

CTAGGTATTC TCTCATTTCT GCCACTGTTA ATTTGATACT AGATAATAAA CTTAGTTCAG 54 00 

CTGCATCATC TGCTGTAATA GGCTCTTCTT TTGATTCATG GTTTGCTAGT TCAGCATTTT 54 60 

TCTCTTTTTC TAGTTGCTGA TACAATAGCT GAGCAGTATT TTGGGAATAG TTTTCGCCCT 552 0 

CTTTTTTATA TTTTAAAAGT TCTTGCTCTG CATACACTTT CCCGATAATC ACTTCCTTAT 5580 

AAACTAATTG CCCATCTTGA GCTTTTAGCT TAATACTCCC ATGCTCTGGA ATTTCAATAT 5 640 

ACTTAATTAT ACCATTTTTT GAGTATAAAA CAAAGCCTTT CTC CATC ATT TTTAATAATT 5700 

TATCATCCTT GTTTTCAGTC ATGCTTTTCT CCTTTATTTC ATTTTATTAT AATCTGAATA 57 60 

CCCCTAGTCT ATTTATTTCA CTAGGTTTTT AGGGTTCGTA TGCTAAAATA CTACCCTTTT 582 0 

TGTGTACCTT ATGGCTGACT TTTCAAATTG GTTAGTT 5857 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AAAATGATAG CAGGAGAGTT TTCCCGTCCA TCAGACCCAG AACT G AG AG C CTTAGCTCAG 60 

GCTTCTCGCC AAAAACAGGC CGCCTTTAAC AAGGAAGAGA ACCCCTTGAA GGGAGCCGAA 12 0 

AT CAT C AAGA CTTGGTTTGC CTCAACCGGG AAAAATCTTT ACATCAACAC TCGCTTGATG 180 

GTGGACTACG GTGT CAAC AT CCATCTAGGG GAAAATTTTT ATTCTAATTG GAACTTGACC 240 

ATGCTGGATA TCTGTCCCAT TCGTATCGGG GACAATGCTA TGATTGGTCC TAATTGTCAG 3 00 

TTTTTGACAC CCCTCCATCC ACTAGATCCA CAGGAACGCA ATTCAGGTAT CGAGTACGGA 3 60 

AAGCCTATCA CAATCGGAGA TAATTTCTGG ACTGGTGGTG GCGTCATTGT CCTTCCTGGA 42 0 

GTGACACTGG GAAATAATGT CGTTGCAGGA GCAGGGGCAG TAATTACCAA ATCTTTTGGC 480 
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GACAACGTTG 


TCCTAGCTGG 


CAATCCTGCG 


CGCGTGATTA 


AGGAAATACC 


TGTTAAATAG 


540 


AAGTAAAAAG 


GAACAGCTGG 


GGTTGTTTCT 


TTTTTGTAGG 


TT TC AT C ATT 


TTTT AC C C AG 


600 


T T C AC AT TT A 


CCTACTCTAT 


CTCTTAGCAA 


GTCTGTTTCA 


TTAAGCAAGT 


TCAAAGCATC 


660 


TCGTAAGTGG 


GATGTTTTTC 


TCCTCAGTTC 


ATCAGCTTCC 


TCCTTGACAC 


TCGGTCAGAT 


720 


TTTGATACAA 


TAGTACAAAA 


TTAGAGGAGG 


CAGGCTATGA 


TTCAGAAACA 


TGCGATTCCT 


780 


ATTTTAGAGT 


TTGATGACAA 


TCCTCAGGCG 


GTTATCATGC 


CCAATCACGA 


GGGGCTGGAC 


840 


TTGCAGTTGC 


CAAAGAAGTG 


TGTTTATGCA 


TTTTTAGGTG 


AGGAGATTGA 


CCGCTATGCG 


900 


AGGGAAGTAG 


GGGCGAACTG 


TGTTGGCGAA 


TTTGTTTCTG 


CCACCAAGAC 


CTATCCAGTT 


960 


TATGTCGTGA 


ACTACAAGGA 


CGAGGAGGTC 


TGTCTGGCTC 


AGGCTCCTGT 


TGGCTCCGCT 


1020 


CCAGCAGCCC 


AGTTTATGGA 


TTGGTTGATT 


GGCTATGGTG 


TGGAGCAGAT 


TATCTCTACT 


1080 


GGGACCTGTG 


GTGTCCTAGC 


TGATATAGAG 


G AAAATGC C T 


TTCTAGTCCC 


TGTTCGCGCT 


1140 


CTGCGAGATG 


AAGGAGCCAG 


TT AC C AC TAT 


GTGGCACCTT 


GTCGTTATAT 


GG AAATG C AG 


1200 


CCAGAGGCTA 


TTGCTGCTAT 


TGAGGAAGTT 


TTGGAAGACA 


GAGGGATTCC 


TTATGAAGAA 


1260 


GTCATGACCT 


GGACGACAGA 


CGGTTTTTAC 


CGAGAAACGG 


CTGAAAAGGT 


GGCTTATCGT 


1320 


AAGGAAGAAG 


GCTGTGCTGT 


TGTGGAGATG 


GAGTGTTCTG 


CTCTTGCGGC 


AGTAGCTCAA 


1380 


TTGCGTGGGG 


TTCTCTGGGG 


TGAATTGTTG 


TTCACAGCAG 


ATTCTCTAGC 


GGACTTGGAC 


1440 


CAGTACGACA 


GTCGTGACTG 


GGGCTCGGAA 


GCTTTTAATA 


AGGCGCTAGA 


AC T G AGTTT A 


1500 


GCAAGTGTTC 


ACCACCTTTA 


GTTGTACTGG 


CAAAGGATTT 


GTTTTATCAT 


AAAATGTCTA 


1560 


GCTCATACTT 


TT C AAAAAT A 


TGTTTAAACG 


AGGTCACCTT 


CCTCTTGTCC 


TAGGCATGTT 


1620 


GAGGTTGGGA 


AAAATCTTTA 


AAAT C AG AAA 


AACGTATCAT 


ATCAGGTGAT 


GAAAACTTTG 


1680 


ACACTATGCG 


TTTTATGTCG 


ATAAGATTTA 


GAGTGAGATG 


AAATG AT AC T 


CTTCGAAAAT 


1740 


CTCTTCAAAC 


CAGGTCAGCT 


TCACCTTGCC 


GTAGGTATAT 


GTTACTGACT 


TCGTCAGTCT 


1800 


TATCCGGCAA 


CCTCAAAACG 


GTGTTTTGAG 


CTGACTTCGT 


CAGTTCTATT 


TGCAACCTCA 


1860 


AAACAGTGTT 


TTGAGCAACC 


TGTGACTAGC 


TTTCTAATCG 


ATGCCTTGGT 


T T T CAT TGCC 


1920 


T AT AAT C AAA 


AAGAGAAATT 


TTCTCCTGAA 


AAGCATATAG 


AGTAGCTGGC 


GTTAAAAGCT 


1980 


CCTGTCTTGC 


TTTTTTGACC 


TATAGTCACA 


TCTATCAAGT 


ATTGTTCTTG 


CCTAAGCTAT 


2040 


CAATAAAAAG 


GTGGCATTTT 


TTAGGCTTGG 


TGTTAGTAGA 


TTTTGCCTTA 


TCCTATCTAA 


2100 


GTCATTTCGA 


ACTTTTTATG 


GTACAATGGA 


AACATGTTAT 


TCAAATTATC 


TAAGGAAAAA 


2160 


ATAGAGCTAG 


GCTTATCTCG 


TTTATCGCCA 


GCCCGTCGTA 


TTTTTTTGAG 


TTTTGCCTTG 


2220 
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GTCATTTTAC TAGGCTCTCT TCTTTTGAGC TTGCCCTTTG TCCAAGTTGA AAGCTCACGA 2 2 80 

GCGACTTATT TTGATCATCT TTTCACTGCT GTCTCTGCAG TCTGTGTGAC GGGTCTCTCA 234 0 

ACCCTTCCAG TAGCTCACAC CTATAATATC TGGGGTCAAA TAATCTGTTT GCTCTTGATT 2 400 

CAGATCGGTG GTCTAGGGCT CATGACCTTT ATTGGGGTTT TCTATATCCA GAGCAAGCAA 2 4 60 

AAGCTTAGTC TTCGTAGCCG TGCAACTATT C AGG AT AG T T TTAGTTATGG AGAAACTCGA 2 52 0 

TCTTTGAGAA AGTTTGTCTA TTCTATTTTT CTCACGACCT TTTTGGTTGA GAGCTTGGGA 2 580 

GCTATTTTGC TTAGTTTTCG CCTTATTCCT CAACTTGGCT GGGGACGTGG TCTTTTTAGT 2 640 

TCCATTTTTC TAGCGATCTC AGCCTTCTGT AATGCCGGTT TTGATAATTT AGGGAGCACC 27 00 

AGTTTATTTG CTTTTCAGAC CGATTTACTG GTCAATCTGG TGATTGCAGG CTTGATTATT 2 7 60 

ACAGGCGGCC TTGGTTTTAT GGTCTGGTTT GATTTGGCTG GTCATGTAGG AAGAAAGAAA 2 82 0 

AAAGGACGTC TGCACTTTCA TACGAAGCTT GTACTATTAT TGACTATAGG TTTGTTGTTA 2 880 

TTTGGAACAG C AACT ACT C T CTTTCTTGAG TGGAACAATG CTGGAACGAT TGGCAATCTC 2 940 

CCTGTTGCCG ATAAGGTTTT AGTTAGCTTT TTTCAAACAG TGACGATGCG AACAGCTGGC 3 000 

TTTTCTACGA TAGATTATAC TCAGGCTCAT CCTGTGACTC TTTTGATTTA TATCTTACAG 3 060 

ATGTTTCTAG GTGGGGCACC TGGAGGAACA GCTGGGGGAC TCAAGATTAC GACATTTTTT 312 0 

GTCCTCTTGG TCTTTGCACG AAGTGAGCTT CTAGGCTTGC CTCATGCCAA TGTTGCGAGA 3180 

CGAACGATCG CGCCGCGAAC GGTTCAAAAA TCCTTTAGTG T CT TT ATT AT CTTTTTGATG 3240 

AGCTTCTTGA TAGGATTGAT TCTGCTAGGG ATAACAGCCA AAGGCAATCC TCCCTTTATC 3 3 00 

CACCTCGTAT TTGAAACCAT TTCAGCTCTT AGTACAGTTG GTGTAACGGC AAATCTGACT 33 60 

CCTGACCTTG GGAAATTGGC TCTCAGTGTT AT CAT G C C AC TTATGTTTAT GGGACGAATT 342 0 

GGTCCCTTGA CCTTGTTTGT TAGCTTGGCA GAT T AC CATC CAGAAAAGAA AGATATGATT 3480 

CACTATATGA AAGCAGATAT TAGTATTGGT TAAGAAAGGA AAGAGCATGT C AG AT CGT AC 3 540 

GATTGGAATT TTGGGCTTGG GAATTTTTGG GAGCAGTGTC CTAGCTGCCC TAGCCAAGCA 3 600 

GGATATGAAT ATTATCGCTA TTGATGACCA CGCAGAGCGC ATCAATCAGT TTGAGCCAGT 3 660 

TTTGGCGCGT GGAGTGATTG GTGACATCAC AGATGAAGAA T T ATT GAG AT CAGCAGGGAT 372 0 

TGATACCTGC GATACCGTTG TAGTCGCGAC AGGTGAAAAT CTGGAGTCGA GTGTGCTTGC 3 780 

GGTTATGCAC TGTAAGAGTT TGGGGGTACC GACTGTTATT GCTAAGGTCA AAAGTCAGAC 3 840 

CGCTAAGAAA GTGCTAGAAA AGATTGGAGC TGACTCGGTT ATCTCGCCAG AGTATGAAAT 3 900 

GGGGCAGTCT CTAGCACAGA CCATTCTTTT CCATAATAGT GTTGATGTCT TTCAGTTGGA 3 9 60 

TAAAAATGTG TCTATCGTGG AGATGAAAAT TCCTCAGTCT TGGGCAGGTC AAAGTCTGAG 4020 
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TAAATTAGAC CTCCGTGGCA AATACAATCT GAATATTTTG GGTTTCCGAG AGCAGGAAAA 4 080 

TTCCCCATTG GATGTTGAAT TTGGACCAGA TGACCTCTTG AAAGCAGATA CCTATATTTT 4140 

GGCAGTCATC AACAACCAGT ATTTGGATAC CCTAGTAGCA TTGAATTCGT AAAGAGGGAT 4 2 00 

GACCCCTCTT TTTTGATGCC TAAGATGGCA AATAGAGACA GAAGCCCCTT GTCTTCTAGT 42 60 

AAAAGTTCTT CAAAGGCTGG AC TTT AT GGT AAAATAGAAA GAAGTGACAA GAGAGAGTAA 43 20 

TACTCAATGA AAATCAAAGA TCAAACTAGG AAACTAGCTA CGGGCTGCTC AAAACACTGT 4 3 80 

TTTGAGGTTG CAGATAGAAC TGACGAAGTC AGTAACATCT ATACGGCAAG GCGACGTTGA 4440 

CGCGGTTTGA AGAGATTTTC GAAGAGTATA AGAAAAAATC AGTCCCCTAA AGGAGTAGAT 4 5 00 

TATGAAGTTA TTGTCTATCG CAATTTCTAG CTATAATGCA G C AG C CT AT C TTCATTACTG 4 560 

TGTGGAGTCG CTAGTGATTG GTGGTG AG C A AGTTGGGATT TTGATTATCA ATGACGGGTC 4 62 0 

TCAGGATCAG ACTCAGGAAA TCGCTGAGTG TTTAGCTAGC AAGTATCCTA ATATCGTTAG 4 6 80 

AG C C AT CT AT CAGGAAAATA AATGCCATGG CGGTGCGGTC AATCGTGGCT TGGTAGAGGC 4 74 0 

TTCTGGGCGC TATTTTAAAG TAGTTGACAG TGATGACTGG GTGGATCCTC GTGCCTACTT 4 8 00 

GAAAATTCTT GAAACCTTGC AGGAACTTGA GAGCAAAGGT CAAGAGGTGG ATGTCTTTGT 4 8 60 

GACCAATTTT GTCTATGAAA AGGAAGGGCA GTCTCGTAAG AAGAGTATGA GTTACGATTC 4920 

AGTCTTGCCT GTTCGGCAGA TTTTTGGCTG GGACCAGGTC GGAAATTTCT CCAAAGGCCA 4 9 80 

GTATACCATG ATGCACTCGC TGATTTATCG GACAGATTTG TTGCGTGCTA GCCAGTTCTA 5040 

ACTGCCTGAA CATACTTTTT ATGTCGATAA TCTCTTTGTC TTTACGCCCC TTCAGCAGGT 5100 

CAAGACCATG TACTATCTGC CTGTCGATTT CTATCGTTAT TTGATTGGGC GTGAGGACCA 5160 

GTCTGTCAAT GAGCAAGTGA TGATTAAGTG CATTGACCAG CAACTCAAGG TCAATCGACT 52 2 0 

CTTGATAGAC CAACTTGATT TGTCCCAAGT GAGTCATCCC AAAATGCGAG AATATCTGCT 5280 

GAATCATATT GAACTCACGA CGGTGATTTC CAGTACCCTG CTCAACCGAT CTGGAACAGC 53 4 0 

GGAGCATCTG GCAAAAAAAC GCCAATTGTG GACCTATATT C AG C AG AAAA ATCCAGAAGT 54 0 0 

CTTTCAGGCT ATTCGTAAGA CCATGTTGAG CCGTTTGACC AAACATTCTG TCTTGCCAGA 54 60 

TCGCAAACTG TCCAATGTCG TCTATCAAAT CACCAAATCT GTTTATGGAT TTAATTAATA 5520 

TAAGTGTTTT ATAAGAGGGA TTTAAGAAAA ATTTTAACTT TTTCTTAGTC CTTTTTAATT 5580 

TCAGGAGATT ATACTAGAGT CAT C AAAT AA AGAAAGACTC TAAGGAGAAT CCTATGAAAT 5640 

TCAATCCAAA T C AAAGAT AT ACTCGTTGGT CTATTCGCCG TCTCAGTGTC GGTGTTGCCT 57 00 

CAGTTGTTGT GGCTAGTGGC TTCTTTGTCC TAGTTGGTCA GCCAAGTTCT GTACGTGCCG 57 6 0 
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ATGGGCTCAA TCCAACCCCA GGTCAAGTCT TACCTGAAGA G AC AT C G G G A AC G AAAG AG G 582 0 

GTGACTTATC AGAAAAACCA GGAGACACCG TTCTCACTCA AGCGAAACCT GAGGGCGTTA 5880 

CTGGAAATAC GAATTCACTT CCGACACCTA CAGAAAGAAC TGAAGTGAGC GAGGAAACAA 5940 

GCCCTTCTAG TCTGGATACA CTTTTTGAAA AAGATGAAGA AGCTCAAAAA AATCCAGAGC 6000 

TAACAGATGT CTTAAAAGAA ACTGTAGATA CAGCTGATGT GGATGGGACA CAAGCAAGTC 6060 

CAGCAGAAAC TACTCCTGAA CAAGTAAAAG GTGGAGTGAA AGAAAATACA AAAGACAGCA 612 0 

TCGATGTTCC TGCTGCTTAT CTTGAAAAAG CTGAAGGGAA AGGTCCTTTC ACTGCCGGTG 6180 

TAAACCAAGT AATTCCTTAT GAACTATTCG CTGGTGATGG TATGTTAACT CGTCTATTAC 624 0 

TAAAAGCTTC GGATAATGCT CCTTGGTCTG ACAATGGTAC TGCTAAAAAT CCTGCTTTAC 63 00 

CTCCTCTTGA AGGATTAACA AAAGGGAAAT ACTTCTATGA AGT AG AC T T A AATGGCAATA 63 60 

CTGTTGGTAA ACAAGGTCAA GCTTTAATTG ATCAACTTCG CGCTAATGGT ACTCAAACTT 642 0 

AT AAAGC T AC TGTTAAAGTT TACGGAAATA AAGACGGTAA AGCTGACTTG ACTAATCTAG 64 80 

TTGCTACTAA AAATGTAGAC AT C AAC AT C A AT GG ATT AGT TGCTAAAGAA AC AGT TC AAA 6540 

AAGCCGTTGC AGACAACGTT AAAGACAGTA TCGATGTTCC AGCAGCCTAC CTAGAAAAAG 6 600 

CCAAGGGTGA AGGTCCATTC ACAGCAGGTG TCAACCATGT GATTCCATAC GAACTCTTCG 66 60 

CAGGTGATGG CATGTTGACT CGTCTCTTGC TCAAGGCATC TGACAAGGCA CCATGGTCAG 672 0 

AT AACGG CG A CGCTAAAAAC CCAGCCCTAT CTCCACTAGG CGAAAACGTG AAGACCAAAG 67 80 

GTCAATACTT CTATCAAGTA GCCTTGGACG GAAATGTAGC TGGCAAAGAA AAACAAGCGC 6840 

T C AT TG AC C A GTTCCGAGCA AAyGGTACTC AAACTTACAG CGCTACAGTC AATGTCTATG 6900 

GTAACAAAGA CGGTAAACCA GACTTGGACA ACATCGTAGC AACTAAAAAA GTCACTATTA 69 6 0 

ACATAAACGG TTTAATTTCT AAAGAAACAG TTCAAAAAGC CGTTGCAGAC AACGT T AAAG 7 02 0 

AC AGT AT CG A TGTTCCAGCA GCCTACCTAG AAAAAGCCAA GGGTGAAGGT CCATTCACAG 7080 

CAGGTGTCAA CCATGTGATT CCATACGAAC TCTTCGCAGG TGATGGTATG TTGACTCGTC 714 0 

TCTTGCTCAA GGCATCTGAC AAGG C AC CAT GGTCAGATAA CGGTGACGCT AAAAACCCAG 7200 

CCCTATCTCC ACTAGGTGAA AACGTGAAGA CCAAAGGTCA ATACTTCTAT CAATTAGCCT 72 60 

TGGACGGAAA TGTAGCTGGC AAAGAAAAAC AAGCGCTCAT TGACCAGTTC C GAG C AAACG 73 2 0 

GTACTCAAAC TTACAGCGCT ACAGTCAATG TCTATGGTAA CAAAGACGGT AAACCAGACT 73 80 

TGGACAACAT CGTAGCAACT AAAAAAGTCA C TAT T AAC AT AAACGGTTTA ATT T C T AAAG 74 4 0 

AAACAGTTCA AAAAGCCGTT GCAGACAACG TTAAGGACAG TATCGATGTT CCAGCAGCCT 7 5 00 

AC CT AG AAAA GGCCAAGGGT GAAGGTCCAT TCACAGCAGG TGT C AAC CAT GTGATTCCAT 7560 
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ACGAACTCTT CGCAGGTGAT GGCATGTTGA CTCGTCTCTT GCTCAAGGCA TCTGACAAGG 7 62 0 

CACCATGGTC AGATAACGGC GACGCTAAAA ACCCAGCTCT ATCTCCACTA GGTGAAAACG 7 6 80 

TGAAGACCAA AGGTCAATAC TTCTATCAAG TAGCCTTGGA CGGAAATGTA GCTGGCAAAG 7740 

AAAAACAAGC GCTCATTGAC CAGTTCCGAG CAAACGGTAC TCAAACTTAC AG C GCT AC AG 7 8 00 

TCAATGTCTA TGGTAACAAA GACGGTAAAC CAGACTTGGA CAACATCGTA GCAACTAAAA 7 8 60 

AAGT C ACT AT TAAGATAAAT GTTAAAGAAA CATCAGACAC AGCAAATGGT T C ATT AT C AC 7 92 0 

CTTCTAACTC TGGTTCTGGC GTGACTCCGA TGAATCACAA TCATGCTACA GGTACTACAG 7980 

ATAGCATGCC TGCTGACACC ATGACAAGTT CTACCAACAC GATGGCAGGT GAAAACATGG 8040 

CTGCTTCTGC TAACAAGATG TCTGATACGA TGATGTCAGA GGATAAAGCT ATGCTACCAA 8100 

AT ACTGGT G A GACTCAAACA TCAATGGCAA GTATTGGTTT CCTTGGGCTT GCGCTTGCAG 8160 

GTTTACTCGG TGGTCTAGGT TTGAAAAACA AAAAAGAAGA AAACTAATCA GCTAAGGAAA 822 0 

TAAATGATGG ATAGTGGGCT GACTAAGATT AGTTTAACAA CTCAATCAGC AATCAGGACT 82 80 

TTCTTTCAAT AGCAGATTAA AATCATCGTA AAACAATAAA AATAGTGTTA T ACT T AAAG C 8340 

AGTATAGCAC TGTTTTTATC AAAGGAGAGA CAGATGGGAA AGACAATTTT ACTCGTTGAC 8400 

GACGAGGTAG AAAT C AC AG A T ATT CATC AG AGATACTTAA TTCAGGCAGG TTATCAGGTC 8460 

TTGGTAGCCC ATGATGGACT GGAAGCGCTA GAGCTGTTCA AGAAAAAACC GATTGATTTG 852 0 

AT TAT C AC AG ATGTCATGAT GCCTCGGATG GATGGTTATG ATTTAATCAG TGAGGTTCAA 8580 

TACTTATCAC CAGAGCAGCC TTTCCTATTT ATTACTGCTA AGACCAGTGA ACAGGACAAG 8 64 0 

ATTTACGGCC TGAGCTTGGG AGCAGATGAT TTTATTGCTA AGCCTTTTAG CCCACGTGAG 8700 

CTGGTTTTGC GTGTCCACAA TATTTTGCGC CGCCTTCATC GTGGGGGCGA AAC AG AG CTG 87 60 

ATTTCCCTTG GCAATCTAAA AATGAATCAT AGTAGTCATG AAGT T C AAAT AGGAGAAGAA 8 82 0 

ATGCTGGATT TAACTGTTAA ATCATTTGAA TTGCTGTGGA TTTTAGCTAG TAATCCAGAG 8880 

CGAGTTTTCT CCAAGACAGA CCTCTATGAA AAGATCTGGA AAGAAGACTA CGTGGATGAC 894 0 

ACCAATACCT TGAATGTGCA TATCCATGCT CTTCGACAGG AGCTGGCAAA AT AT AGT AG T 9000 

GACCAAACTC C C ACT ATT AA GACAGTTTGG GGGTTGGGAT ATAAGATAGA GAAACCGAGA 9060 

GGACAAACAT GAAACTAAAA AGTTATATTT TGGTTGGATA TATTATTTCA ACCCTCTTAA 912 0 

CCATTTTGGT TGTTTTTTGG GCTGTTCAAA AAATGCTGAT TGCGAAAGGC GAGATTTACT 9180 

TTTTGCTTGG GATGACCATC GTTGCCAGCC TTGTCGGTGC TGGGATTAGT CTCTTTCTCC 9240 

TATTGCCAGT CTTTACGTCG TTGGGCAAAC TCAAGGAGCA TGCCAAGCGG GTAGCGGCCA 9300 
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AGGATTTTCC TTCAAATTTG GAGGTTCAAG GTCCTGTAGA ATTTCAGCAA TT AG GG C AAA 93 60 

CTTTTAATGA GATGTCCCAT GATTTGCAGG TAAGCTTTGA TTCCTTGGAA GAAAGCGAAC 942 0 

GAGAAAAGGG CTTGATGATT GCCCAGTTGT CGCATGATAT TAAGACTCCT ATCACTTCGA 94 80 

TCCAAGCGAC GGTAGAAGGG ATTTTGGATG GGATTATCAA GGAGTCGGAG CAAGCTCATT 9540 

AT C T AG C AAC CATTGGACGC CAGACGGAGA GGCTCAATAA ACTGGTTGAG GAGTTGAATT 9600 

TTTTGACCCT AAACACAGCT AGAAATCAGG TGGAAACTAC CAGTAAAGAC AGTATTTTTC 9 6 60 

TGGACAAGCT CTTAATTGAG TGCATGAGTG AATTTCAGTT TT T G AT TG AG CAGGAGAGAA 97 20 

GAGATGTCCA CTTGCAGGTA ATCCCAGAGT CTGCCCGGAT TGAGGGAGAT TATGCTAAGC 97 80 

TTTCTCGTAT CTTGGTGAAT CTGGTCGATA ACGCTTTTAA ATATTCTGCT CCAGGAACCA 984 0 

AGCTGGAAGT GGTGGCTAAG CTGGAGAAGG ACCAGCTTTC AATCAGTGTG ACCGATGAAG 99 00 

GGCAGGGTAT TGCCCCAGAG GATTTGGAAA ATATTTTCAA ACGCCTTTAT CGTGTCGAAA 99 60 

CTTCGCGTAA CATGAAGACA GGTGGTCATG GATTAGGACT TGCGATTGCG CGTGAATTGG 10020 

CCCATCAATT GGGTGGGGAA ATCACAGTCA GCAGCCAGTA CGGTCTAGGA AGTACCTTTA 10080 

CCCTCGTTCT CAACCTCTCT GGTAGTGAAA ATAAAGCCTA AAACCCCTTT AC AAAT C C AG 1014 0 

CTATTCATGG TAGAATAGAT TTTGTGTGAA ATATCAGCAG G AAAG CAT G A AGCTCGTCAA 10200 

CAGGTGTCTT ATGACAAGTA ACCTTGGCTG TTTAGGCGAA GGGCATCTGC ACGG 102 54 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30: 

CCGGCGACTA TCGATAACAC TTGACTTGGT AGCCCCACAT TTTGGACAAC GCATCCTTTC 60 

CCTCCTTATC GTTTTCTTTT CATTATACCA TTTTTTAAGC G ATT CC C AAA ACAATTCTTC 120 

TTTTTGCTTG ACAAGTTTTT TGTTTTGTTG TATTATTTAA TTAAGACAAC AAGGTAAAAG 18 0 

AAAGGAGACT AAGATGTCCT GGACATTTGA CAACAAAAAA C C C AT C T ATT TACAGATTAT 240 

GGAGAAAATC AAGCTTCAGA TTGTTTCCCA T AC AC T GG AA CCCAATCAAC AACTTCCAAC 300 

CGTGAGGAGC TAGCTAGCGA GGCTGGTGTC AATCCCAATA C C AT CC AAAG AGCCTTATCA 3 60 

GACCTTGAAC GAGAAGGATT TGTCTACAGC AAGCGAACAA CTGGACGATT TGTGACTAAG 42 0 

G AT AAGG AG C TAATCGCCCA GTCACGCAAA CAATTATCAG AAGAAGAATT GGAACACTTC 4 80 
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GTTTCCTCCA TGACCCATTT TGGCTATGAA AAAGAAGAAC TACCAGGCGT AGTCAGTGAT 54 0 

TATATTAAAG GAGTTTAAGC CTATGTCATT ACT AGT AT TT GAAAATGTAT CCAAATCATA 6 00 

TGGAGCAACA CCAGCCCTTG AAAATGTTTC TCTTGACATT CCAGCTGGAA AAATTGTCGG 660 

CCTTCTTGGG CCAAACGGCT CAGGAAAAAC AACCCTGATT AAACTAATTA ATGGCCTCTT 72 0 

ACAACCAGAT CAAGGACGTG TCCTCATCAA CGACATGGAC CCAAGCCCAG CAACCAAGGC 780 

CGTTGTAGCT TATTTGCCTG ATACGACCTA TCTCAATGAG CAAATGAAGG TCAAAGAAGC 840 

CCTAACCTAC TTCAAGACCT TCTATAAAGA TTGTCAGATC TTGAACGCGC CCATCATCTA 900 

CTTGCAGACC TGGGCATTGA TGAAAATAGT CGTCTCAAGA AACTATCAAA AGGAAACAAA 960 

GAAAAGGTTC AACTGATTTT GGTTATGAGC CGTGATGCTC GTCTCTATGT TTTGGACGAA 102 0 

CCCATTGGTG GGGTGGATCC AGCAGCCCGT GCTTATATCC TCAATACCAT TATCAACAAC 108 0 

T ACT C AC C AA CTTCTACCGT TTTGATTTCT ACCCACTTGA TTTCTGATAT CGAGCCAATC 1140 

TTGGATGAAA TTGTCTTCCT AAAAGACGGA AAAGTCGTCC GTCAAGGAAA TGTAGATGAT 12 00 

ATTCGCTACG AGTCAGGTGA AT C C AT T G AC CAACTCTTCC GTCAGaATTT AAGGCCTAAG 12 60 

CAAAGGAGAT TATTTATGTT TTGGAATTTA GTTCGCTACG AATTTAAAAA TGTTAACAAG 13 2 0 

TGGTATTTAG CCCTCTACGC AGCCGTGCTA GTCCTTTCTG CCCTCATCGG AATACAGACA 13 80 

CAAGGCTTTA AAAATCTACC TTACCAAGAA AGTCAGGCTA CTATGCTACT TTTTCTAGCT 144 0 

ACAGTCTTTG GTGGCTTGAT GCTTACACTT GGGATTTCAA CCATTTTCTT GAT TAT T AAA 15 0 0 

CGCTTCAAAG GTAGTGTCTA CGACCGACAA GGCTATCTGA CTTTGACCTT GCCAGTTTCT 15 60 

G AAC AC CAT A T CATC AC AG C CAAACTAATC GGTGCCTTTA TCTGGTCATT GATTAGCACC 162 0 

GCTGTATTGG CTCTAAGTGC TGTTATTATT CTGGCTTTAA CAGCTCCAGA ATGGATTCCT 16 80 

CTTTCTTATG TGATTACATT TGTAGAAACA CATCTCCCTC AGATCTTTCT TACAGGTATA 1740 

TCCTTCCTAC TAAATACTAT TTCAGGAATC CTCTGCATCT ACCTGGCTAT TTCCATTGGA 1800 

CAGCTTTTCA ATGAATACCG TACAGCACTC GCTGTTGCAG TCTACATTGG TATCCAAATC 18 60 

GTCATTGGAT TTATTGAACT TTTCTTCAAT CTTAGTTCTA ATTTCTATGT CAATTCACTG 192 0 

GTAGGACTCA ATGACCATTT CTATATGGGA GCAGGTATAG CCATTGTTGA AGAACTCATA 19 80 

TTCATAGCTA TCTTTTATCT CGGAACCTAC TACATCTTGA GAAATAAGGT TAATTTGCTT 2040 

TAAATAATTT TTACCTAGAT ATGTAACATA CTCATAGAAC AAAAGAGACC AGGCAAAAAG 2100 

TCTTTAAAAT TAGAAAACGC ATAGTATCAG GTGTTGAATA TGTACTGCcC CCCAAAAGTT 2160 

AGATTTTTTC TGTCTAACTT TTGGGGGCAG TTCATAAGAA CCTTGGTAAT ATGCGTTTTT 2 22 0 
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TGTGAGCTGA CTTATTTCCT TTCACTATAT CGCAAAATGA AATAAGAACG GAACGATGGG 22 8 0 

ATTTTGGAAT TCAAATCAAT TTATAAGAAT GTTTTAGAAG TAATATTATC CTATTCCAGA 234 0 

TTCAGTTCAC TATACAATTG AGTTTTCAAG CAACCTGTTT ACATAATGTG TACATAATTA 24 00 

GGTTCGTGAT TCCACCCTTT TCACCTTTAA AAACCTCGCT TTCGCAAGGC TCTTCTATTT 24 6 0 

ATAAGATAAG GCACGTTTAA AGGTTTTCCA AATCCCTAAA TCATCCGTTT GAAGAACGAG 252 0 

ACTAGCATAC ATGCGTCCGA TAAATCCTGT TGCTACCACC GCAAAAATCA CTGTAATAGC 2 580 

AAGTGAAATC CATGCTTCTG CTCCCCCCGC ATAGTCATTA ATCGTTCGAA ACGGCATAAA 2 64 0 

GAAGGTCGAA ATAAAGGGAA TATAAGAACC AATCTTCAAG AGGAGATTGT CACCAGCTGC 270 0 

ACCTAGAGCT GTCACTCCAA AAAAACCACC C AT AAT C AAA AT CAT C AAAG GCGACAAGGC 27 60 

TTTCCCTGAG TCCTCAGGAC GAGAAACCAT AGATCCTAGG AAGGCTGCCA AGACTACGTA 2 82 0 

CATGAAAAGA CTGATCAAAA TAAAGAGCAA GGTATTCAGT GAGATAGCAT CTCCCAAGTG 2880 

ATCCAAAATA C C AG AC TG AG CCAAGAATGG CAAATCTTTA AAGAGCAAAA CGGCAGCCAG 2 94 0 

ACCACCTACA AC AT AG AT C C CAATATGCGT TAAAATCACT AGAAACAGAG CCATCATCCG 3 0 00 

CGCATAGAAA TAGTGACTTG CCCTTATGCT AGAAAAAACG ACTTCCATAA TTTTGGTGCC 3 0 60 

TTTTTCACTG GCAACTTCCT GAGCTGTTAC ACCCGCATAG GT AAT C AG AA T CAT AT AAAG 312 0 

AAAG AAT C C T AAGGCACCTG CTGCAATTGT TTGAATAAAC TTTTTATTTT CCTTGGCTTC 3180 

ATCAATCTTT TCTGTGAATT GAATTGTCTG CGCTAAGCGT TTTTCCTGCT CTTGAGACAA 3240 

GGAAGCAGTT G AAC G AT T AA GCTGATTTTG CAGTTCATTG AGTGTACCTG TAACCTCAAA 3 3 00 

TTTAATTCCA TTTTCAAGCG ATGTTTCGCC ATGATAAACT GCCTTTAGAA CACTATCTTC 3 3 60 

TTGATCAATG GTCAAATAAC CTTTTAATTT TTCTTCTTTA ATTGCTTCTT TGGCACTTGC 34 2 0 

TTCGTCTTTA TAGTCGAAGT TAACACCATT TACATTCTTC AGTCCTTCTG CTACAGATGG 3480 

CACTGTTGTC ACTACTGCCA CTTTATTATT TTTAGCCATA GAAGAACCTT GGAGATGCCC 3 54 0 

AATTCCTACA GAGATTCCTA AAAAGAGGAA CGGCGAAATC ACCATAAAGA AGAAACTCCA 3 6 00 

TGACTCGACA TGTCGAAGAT AGGTTTCCTT G ATT AC AAC C CACATATTTC T CAT AC T TC C 3 6 60 

ACTCCTGATT CTAGTTTAAA GATTTCATCG ATAGTTGGCG CTTGTTGGTC AAATGTTGCG 3 72 0 

ATATATTGAC CTTGAGTCAA GATTGAGAAG AGTTCCCTTC CAGCGCTCTC ATCCTCCAAA 3 7 80 

ATCAATTTCC AACTGCCTTG TTTGGTCAAG CTCACCTGTT TGACATGAGG AAGATTTTCC 3840 

AATTCTTCCT TGCTTCGTTC ACTTGAAACA AAGAGACGCG TTTTCCCGTA TTGATTGCGG 3 900 

AC AT CCTG AA CTGGTCCGTG CAAGACCACA CGGCCATCTC GG AT CATC AG AATATCGTCA 3 960 

CAAAGTTCCT CAACATTGGT CATGACATGG TCAGAAAAGA TAATGGTTGT CCGCGCTCTT 4020 
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TTTCCTGAAA AATGACTTGT TTGAGCAATT CTGTATTAAC TGGGTCCAAT CCACTAAAAG 4080 

GCTCATCCAA GATAATCAGG TCTGGTTCAT GAATCAGAGT AATAATGAGC TGAATCTTCT 4140 

GCTGATTTCC TTTTGACAGA CTCTTGATTT TATCTGTCAG CTTTCCTTTC ACTTCCAACC 42 00 

TCTTCATCCA TTGAGGGAGT TTTTCTTTGA CTTCTTTGGC ATCCATGCCT TTTAGAGTCG 42 60 

CCAAGTAGCG AACTTGTTCA AG AACTGT C A ATTTAGGCAT GAGATGCGTT CTTCAGGCAG 4320 

ATAACCAATC CGAGCATAGG TCTCCTGACG AATATCCTGA CCATCCAGAC CGATTTCTCC 4380 

CTGATATTCT AGGAATTTCA AAATACTATG GAAAATCGTT GTTTTTCCAG CACCATTTTT 4440 

TCCGACTAGT CCCAAAATAC GACCTGGTCG CGCTTGAAAG TCAATACCAA ACAAAACTTG 4500 

CTTGGATCCA AAACTTTTCT CTAGACTTCT TACTTCTAGC ATCTTTCACC TCCGAAATTT 4 560 

CTTGCACTCA TTATACTCCT TTTTGATAGC CTTTACAATG TTTTTTGTCC ATTTTTAGAA 4 620 

GACTATTGCT GTGTAAAATA TGGCCTGGAG CACTTTTATA CTCAATGAAA ATCAAAGAGC 4 680 

AAACTAGGAA GCTAGCCGTA GACTGCTCAA AGTACAGCTT TGAGGTTGCA GATAAAACTG 474 0 

ACGAAGTCgA CTCAAAACAC TGTTTTGAGG TTGTGGATAG AACTGACGAA kCrTAaCTAT 4800 

ATCTACGGCA AGGCGAAcTG ACGTGGTTTG AAGAGATTTT CGAAGAGTAT TAGTGATAAA 4 860 

T C CAT TAT AC AGCAGCAAAC TTAATTTATA CCTTCCGCTC CTCAACTGTC TATTTTTAAT 4 92 0 

CCTGAATTGT TATTTGAGTA ACTCCTTTTT CCTCGTAAAG TTTTCTTCCT CTAAAACTTC 4 980 

TGGAAAAAGG CTAATAGTTT CAGACAACAT TTTTATAAGA AACAAGTTCA TCTGTCATTT 504 0 

CAAGAAGGAG TAATCCTTTA TCTACTAATG GACGGAACAG AATTCAACCG CTTGTCCGAT 5100 

ATGTTTTCTA AGGATTATAT AGTAAAATGA AATAAGAACA GGACAAATTG ATCAGGACAG 5160 

TCAAATTGAT TTCTAACAAT GTTTTAGAAG TAGATGTATA CTATTCTAGT TTCAATCTGC 522 0 

TATATCTATT ATGCACACCC CTATAGGATC TAATGAAAAT CACAACAGGC TCATTCATAG 5280 

ATGGTTACCT AAGCCTAAGG GAACTAAGAA AACG ACT AC C AAGGAAGTCG CATTCATCGA 534 0 

AAAGT AG AT T AACAACTATC CTAAAAAATG CTTGAACTAC AAGTCCCCCA G AG AAG ACT T 54 00 

CTGGATGACT AACTTGAACT TGAAATTTAG CAATAATTAA TTCACTATCT AACTATATTT 54 6 0 

AGTAATTATT TCAGAACTGA TTAATATTAA AATTAACTAA CAATTCAAAG GATTCATACT 552 0 

AG C C AT AAAT TACGTCCATC AGAGAGAGAC TCTTACTACT TTTAGATTTT AGTCTTTCTA 55 80 

GCTTCAGAAT ACATCTAAAC TTTAGGGAAA ATGACTATTC GAAAGCGCGA ATGCCTCAAA 5640 

ATT AT CT CAG ATAAGCTATT CGAAACTTAG AATGCTTTTA AATTTATGGA ATTGCGATTA 57 00 

TTCGAAACCT AG AATG CAT A TAACCTTTAG TTGACAGACC TATTCTAAGT CTCGAAGGGC 57 60 
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TATTTACTTT CTATTCCTTA TCAAAAAAGA CTCATTCCCC CTTTCTCCTC CAAAATATGG 5820 

TATAGTAGAA AT AT AC TAT C TATGAGGAGT TTACATGTCA CAGGATAAAC AAATGAAAGC 5880 

TGTTTCTCCC CTTCTGCAGC GAGTTATCAA TAT CT CAT CG ATTGTCGGTG GGGTTGGGAG 5940 

TTTGATTTTC TGTATTTGGG CTTATCAGGC TGGGATTTTA CAATCCAAGG AAACCCTCTC 600 0 

TGCCTTTATC CAGCAGGCAG GCATCTGGGG TCCACCTCTC TTTATCTTTT TACAGATTTT 6060 

ACAGACTGTC GTCCCTATCA TTCCAGGGGC CTTGACCTCG GTGGCTGGGG TCTTTATCTA 612 0 

CGGGCACATC ATCGGGACTA TCTACAACTA TATCGGCATC GTGATTGGCT GTGCCATTAT 6180 

CTTTTATCTA GTGCGCCTAT ACGGAGCTGC CTTTGTCCAG TCTGTCGTCA GCAAGCGCAC 624 0 

CTACGACAAG TACATCGACT GGCTAGATAA GGGCAATCGT TTTGACCGCT TCTTTATTTT 63 00 

TATGATGATT TGGCCCATTA GCCCAGCTGA CTTTCTCTGT ATGCTGGCTG CCCTGACCAA 63 60 

GATGAGCTTC AAGCGCTACA T G ACC AT CAT CATTCTGACC AAACCCTTTA CCCTCGTGGT 6420 

TT AT AC C T AC GGTCTGACCT ATATTATTGA CTTTTTCTGG CAAATGCTTT G AC AC GT AAA 6480 

AAATCCGTTT GGTTTCCCAA GTGGATTTTT AAAGC GT AG A TTAACTATAG CTTGATACTA 6540 

AATATACTTT GGTATGGAAA T C AT GC AT AT TT T T C GAT AG TGAGGCGAGG ACTTACCTAG 6600 

CCTTTCCGCC GTGATAGAAA CACCTGAAAT CTAATGGTTT CAGGTATTCG GAAACTTTGA 6660 

GCCTAGTGTC TCAAAGTTTA GGTATGGAAT TTTGAAGAAA GTCGCTACCG TCCGTAATCA 672 0 

CT T AAGG AAA GGCTCAAAAA TATTGTTTTC AACCACAAAA TCCGTTTGGT TTCCCAAGCG 67 8 0 

GATTTTGTGC TTTATTTTGA AACTTCTTTT GCAAGAACAA AGTTCCCAAG TGTGGCAGAA 684 0 

CCATTTCCTG CGACTGCTGG CGTCACGATA TAGTCACGCA CATCTGGTAC TGGTAGGTAA 690 0 

CCATTAAGAA GAGATGTAAA TTTCTCACGG ACACGGTCCA GCATATGTTG TTGAGCCATG 6960 

ACCCCTCCAC CAAAGACAAT CACGTCTGGG CGGAAAGTCA CTGTCGCATT AACCGCAGCT 7 02 0 

TGAGCGATAT AGTAGGCTTG AACATCCCAA ACAGGGTTGT TGAGTTCAAT AGTTTCCCCA 7080 

CGTACACCTG TACGAGCTTC CAAACTTGGA CCAGCTGCAT AACCTTCTAG ACATCCCTTA 7140 

TGGAAAGGAC AAACACCCTT AAACTCTTTT TCAATATCCA TTGGGTGTCT AGCAACATAA 72 0 0 

TAATGACCCA TTTCAGGGTG ACCCACACCA CCGATAAACT C ACC AC GT TG GATGACGCCT 72 60 

GCACCGATAC CTGTACCGAT TGTGTAGTAA ACCAAGTTTT CGAT AC G AC C ACCAGCATTG 7 32 0 

TTACGGGCAA CCATTTCACC GT AAG C AG AG CTGTTTACGT CTGTTGTGAA GTACATTGGC 7 3 80 

ACGTTTAGGG CGCGACGAAG GGCACCAAGC AAGTCTACAT TTGCCCAGTT TGGTTTTGGA 74 40 

GTCGTCGTGA TAAAGCCATA AGTTTTTGAG TTTTTGTCAA TATCAATCGG CCCAAATGAA 7 500 

CCAACTGCAA GACCAGCAAG GTTATCGAAT TTTGAGAAGA ACTCAATGGT TTTATCGATT 7 560 



WO 98/18931 



PCT/US97/19588 



323 

GTTTCGATTG GAGTTGTTGT TGGAAATTGT GTTTTTTCTA CAACGTTAAA GTTTTCATCA 7 62 0 

CCGACAGCAC AG AC AAAC TT TGTACCGCCC GCTTCCAAGC TTCCATATAA TTTTGTCATG 7 680 

ATAAACCTCT TGTTTTTATT TTCTTTATTA TAG C AT ACT T CGAAAGTCTA AATGTCTCTA 7 74 0 

TTTTTTAGAT TTTCCTCTGT AAATCTTACT AT CT AAT AAA AACGAACAAA CATGTCATTT 7 8 00 

GTTCGTTTTC ACATTAGAGA GGATTGATTA GATTTTCACT TCGATCACAG CATCCCCCTT 7 8 60 

AGCAACTGAA CCTGTTGCGA CTGGAGCTAC TGAAGCGTAG TCACCTGTAT TTGTAACGAT 792 0 

AACCATTGTT GTATCATCAA GTCCAGCTGC AGCGATTTTG TTTGAGTCAA ATGTTCCAAG 7 9 80 

AACATCGCCA GCTTTCACCT TATTACCTTG AGCAACTTTT GTTTCAAAAC CGTCACCGTT 8040 

CATAGATACA GTATCAATAC CAACATGAAT CAAAACTTCA GCACCATTTC TTGTTTTCAA 8100 

ACCAAAAGCG TGCCCTGTTG GAAAGGCAAT TG AAAC TT C A GCATCAGCTG GTGCATAGAC 816 0 

CACGCCTTGG CTTGGTTTCA C AACG AT AC C TTGTCCCATA GCTCCACTTG AGAAGACTGG 82 2 0 

GTCATTGACA TCAGCAAGAG CGACAACATC ACCGACGATA GGAGTTACAA GTGTTTCATT 82 80 

TTGAAGAGCT GCTGGCGCAA CTTCTTCTTT TTCTTCAGCC ACTTCAGCTC GTTTTGCAGC 8 34 0 

TGCAGTTGCG TCTACTTCAT CTTCGTAACC AAACATGTAA GTAAGAGCAA AACCAAGGGC 84 00 

AAATGATACA GCTACCATAA GAAGGTATTG TGGAAGTTGT CCGTTACCAA CATAAAGCAT 84 6 0 

TGTACCAGGG ATGATGGTGA TACCATTACC AGTACCAGCA AGTCCAAGGA TAGAAGCCAA 8 52 0 

TCCACCACCG ATTGCACCAG CAATCAATGA AAGGAAGAAT GGTTTACGGA AGCGCAAGTT 8580 

CACCCCGAAG ATAGCAGGCT CTGTAATACC TAGGAAGGCA G AAAG AG C AG CCGGGAAAGC 8 64 0 

AAGTGTTTTC AGTTTTGGAT TTTTTGTTTT AACACCAACC GCAACAGTAG CAGCACCTTG 87 00 

AG CTGT CAT A GCAGCTGTGA TGATAGCGTT GAATGGGTTA GCATGGTCAG CAGCAAGTAA 87 60 

TTGCACTTCA AGCAAGTTGA AGATGTGGTG CACACCTGAC AC G AC G ATC A ATTGGTGAAC 8820 

CCCACCAATC AAG AAAC C AC CAAGACCAAA TGGCATGCTA AGAATCGCTT TTGTAGCAAT 8880 

AAGGATGTAG TTTTCAACAA CGTGGAAAAC TGGTCCAATG ACAAAGAGTC CAAGGATAGA 894 0 

CATGACCAAA AGTGTCACGA ATGGTGTTAC CAAGAGGTCA ATGACATCTG GAACAACTTG 9000 

CGG AC AG CTT TTTCAAATTT AGCTCCGACA ACCCCGATGA TGAAGGCTGG AAGAACGGAA 9060 

CCTTGCAAAC CAACAACAGG GATGAAACCA AAGAAGTTCA TCGCTGTTAC TTCACCACCT 912 0 

TGAGCAACTG CCCAAGCGTT TGGAAGTGAG CCAGAGACAA GCATCATACC AAGAACGATA 9180 

CCAACGGCAG G ATT T C C AC C AAATACACGG AAGGTTGACC ACACAACCAA ACCTGGCAAG 9240 

ATGATGAAGG CTGTATCTGT CAAGATTTGT GTGTAAGTTG CAAAGTCACC TGGAAGTGGC 93 00 
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ATTTCAAGAG CGTTGAAAAG ACCACGCACA CCCATGAAGA GACCTGTCGC TACGATAACT 



9360 



GGGATGATTG GAACGAAAAC ATCACCAAAA GTACGGATAG CACGTTGGAA CCAGTTCCCT 



9420 



TGTTTAGCAA CTTCTGCTTT CATGTCATCC T TAG AT GAT G TTGGTAATCC AAGTACAACA 



9480 



ACTTCATCGT ACATTTTGTT AACTGTACCT GTACCAAAGA TAATTTGGTA TTGCCCTGAG 



9540 



TTAAAGAAAG CACCTTGAAC TTTTTCCAAG TTCTCAATCA CTTCTTTATT GATTTTCTCT 



9600 



TCATCTTTGA CCATGACACG TAGACGAGTC GCACAGTGGG CAACACTATT GACATTTTCA 



9660 



CGTCCGCCCA AGGCATCGAT GACTTTTTTT GCAATTTCCT GATTGTTCAT TTGCAAAAAT 



9720 



CTCCTTATAT AACATTTTGT TCTTGTTTGA AAGCGATTTT ATTCGCCGG 



9769 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

CGCTTGAGTG CTAATTCATA GTTCTATTGT ATCACTTGGT CAGAAATAAT CAAGAAAAAA 60 

GTCTGACTTT CTCAAGATAA AAAGCCTGAG ACCAACTCAG ACTTTTTAAT TCTTAAAATG 12 0 

GCAATTCTTC CTCTTCCAAG ACCAAATCTG CCAAATCTTG GCCTGCATTA TTTTCACGCA 18 0 

TAGCACGTTG GGCACGACTT TCCAAGAGTT GGAATCCTGT GACAAGTACT TCGGTCACGT 24 0 

AGTTCATTTG GCCATTTTTC TCAAAGCGAC GGGTACGCAA TTCTCCATCA ACGGAAATGA 3 00 

GACTACCTTT GGTTGCGTAC TTGCCAAAGT TTCTGCTAGT CTGCCCCATA GGACCATATT 3 60 

G AC AAAAT C A GCTTCACGTT CACCGTTTTG GTCTTTGTAA CGACGGTTCA CAGCGATAGT 420 

TGCTCGCGCT ACCGACTTGT CATTGTTGGT TTTGTGCAAT TCTGGTGTAG ACGTTAAACG 48 0 

TCCAATCAAG ATAACTTTAT TATACATATT TTCTTCCTCC TACTTATCTA TTCGTAGGAA 54 0 

ATCAAAAAAA GTTACAGAAA TTTGTAACTT TTCGAGAAAA TTTTTTATTT TTTATGAACC 600 

ATGAAACCTG TCGCCTGTTG ATTGGCCATA ATGGTCATAT CTGTAATCTG AACACGACGA 660 

GGTTGACTAG TCACATAGAC TACTGTATCT GCAATATCCT GAGCTTGCAA AGCTTCTATT 72 0 

CCTTGGTAAA CGGACGCAGC TCGTTCTTTA TCACCATGAA AACGCACTGT AGAAAAATCT 780 

GTTTCGACAA TTCCAGGCTG AATGGTCGTC ACCTTGATAT CCGTTGCGAT GGTATCAATT 840 

CGCAGTCCAT CTGAAAAGGT CTTAACTGCC GCCTTGGTGG CTGAGTAAAC AGCTGCACCA 900 

GCATAGGCAT AAATTCCTGC GGTTGACCCC ATATTGATAA TATGACCTTG ATTGGCTTTT 960 
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ACCATTGCTG GCAAGAAACA GCGAGTGACT GCCATCAAAC CTTTGACATT GGTATCCAAC 102 0 

ATGGTCAGCA TATCCAACTC TTCATAGTCT TGATAGGGAG CTAAGCCAAG AGCCAGTCCT 1080 

GCGTTATTGA CCAGGATGTC AATCTGACCT ATCGTTTCTA AAAT AT C AG A GCAGACAGTC 1140 

TTTACCATTG TCATATCCGT G AC AT C T AGG AGAAAAGTCC AAACTGTTTG ATTTGGAAAA 12 00 

GTTTCTGCAA ACTCCGCCTT AAGAGCTTCT AGTCTGTCTA TCCGTCGTCC TGTTAGAACG 12 60 

ACATCCTCAC CCTGCTCCAG ATAAGCACGC GCAATCGCTT CACCGATTCC TGATGTCGCT 13 2 0 

CCTGTAATCA CAACATTTTT TGCCATCTTA TTTCCTTCTA GCTGGTCTAT CAGATATTAA 13 80 

CAACTTCTTA GGCAGTCCAG TGTTTCGCTG GGTCGAACGG TGTTCCGACA ACTTGGTCTT 144 0 

CTGATAATTC AAGCACCCCA CGTTTTTGTG GAGCATTTGG CAGATGCAAT TC AC G AGG AC 1500 

TGCACATCAT ACCAAAACTC TTTTCACCAC GAAGTTCACC TGGGAAAATG AGATTCCCTT 1560 

TTGGCATCAT AGCTCCAGGA AGCGCGACAA TGGTTTTCAA CCCCACACGC GCATTGGGAG 162 0 

CTCCTGCAAC GATTTGTACA GTCTTATCAC TTGCGACTGC AACTTGGCAG ATGTTGAGGT 16 80 

GGTCACTATC TGGATGGGCT ACCATCTCAA CAATTTCACC TACAACAAAC TTAGGTTCCT 174 0 

TATCATTAAC AATTTCTTCT GTAAAACCTT CCGCCTGCAA CTCTTGGTTC AAACGAGCGA 18 00 

CTTGCTCATC TGTCAAAAAG ACTTGACCGC GCTCTGCAAT TTCAAATAAA CTTGAAACTT 1860 

CGAAAATATT CCAAGCCACT GTTTCCCCAT TATCTTTGAG AAAAACACGG GCTACCTTGC 192 0 

CTTTGCGCTC CACATCCAGT TTGGCATCTC CGCTATTTTT CACGATGACC ATAAGGACAT 19 8 0 

CACCGACATG TTCTTTATTA TATGTAAAAA TCATTGTTTC CTTTTTCTCC TATTTCAGTC 2040 

CTGCTAAAAA GTCATTGATT TGTTGCTTGC TTTTACGGTC GCGATTGACA AAACGACCGA 2100 

TTTCCTTGTC CTTTTCTAGA ACAACAAGGC TAGGAATTCC GTAAACATCC CAGAGTTTGG 2160 

CCAAATCCAT ATACTGATCT CGGTCCATTC GAATAAAGGT GAACTCTGGA TTGGTCTCCT 2 22 0 

CAATCTCTGG TAAGGCAGGA TAAATATAAC GACAATCGCT ACACCAGTCT GCCACAAAAA 2280 

TGAAGACCTT CTTGCCCGCT TTTTCCACTA AAGATGCTAA TTCTTCTAAA CTTGCTGGCT 2 34 0 

GTATCATAAG ACTTCCTCCT CATAGACTAG GTCTTCATTT T CAT AG AC AA AGGTATAATG 2400 

ACGGCCATCC TCAAAAATGA CGCCACCAAC CAAGCTCTCC AGACTGCTTT CGTAAACTTG 2460 

AACATAAAGG GTCGCAATTT CCCCCATGTC GGAAAAATGG TCTCGCACAA TCTCTGTCAA 2520 

CTCTTCCTGA GTCTTCATGA GCTTACGGTC ATCTGCAACT TTTTTCGTAG CAAGAGCAAG 2580 

GCTTCCGATA C C TAGC AG AG CCAAGCCTGC CATCCACATT TTTTTAGCTT T CAT AC CAT T 2 640 

CATTTTAACA CAAAAAAGGC TTCAGGACAA ATGAGGAAGC AGCAGAAAAG CAAGTAAAAA 2 700 
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GCCTCTTCCT TTAAGGAAAA GGACTTCTTA TACTCAATGA AAATCAAAGA CCAAACTAGG 



2760 



AAGCTAGCCG CAGGCTGCTC AAAGCACTGC TTTGAGGTTG TAGATAGAAC TGACGAgTCa 



2820 



CTCAAAACAC TGTTTTGAGG TTGTGGATGA AGCTGACGTG GTTTGAAGAG ATTTTCGAAG 



2880 



AGTATTATTC TTATTGCCAG GCACCTAAGT TGCCAACGTA GTAACTATCA GGTGTGTAGG 



2940 



TATTGCGAGC ATCTTACCTG ATGAAGCCAG ATAATACTAC TTGCCATTGT CTTTGACCCA 



3000 



ATCATTCGCA ATCATGGAAC CAGAAGAACT TACATAATAC CATTCTCCCT TGTCATAAAC 



3060 



CCAAGTACTG ACTTTCATGG TTCCTGAGCA ATTAAAGGCA AAAAAACTGT CCAATAACAT 



3120 



TCGTTTTTTA AAAGCATTTG ACACTACAT 



3149 



(2) INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CCAAAAATTC AACCTTTAAG GGGAGTCCAG AGAGACTCAC AAGGTGTCAG ATAAAAGAAT 60 

GGTGCAATTT T C T AG AGG AG ACTTTTTGAG TGTGCTCTCT TGTGTTGTAC GATTTTAACT 12 0 

GAGGCCTTGC ACTAGCAAGG TCTTTTCTTT ATCTGGTCCC CTTAAAATTT AAGGAGGAAA 180 

AGTTATGAAT CCCACATGTA AGAAGCGTTT GGGTGT C AT T CGGTTGGAAA CCATGAAGGT 240 

GGTTGCACAA GAGGAAATCG CGCCACAATC TTTG AAT TAG TCCTAGAAGG AGAAATGGTT 300 

GAAGCCATGC GAGCAGGCCA ATTTCTTCAT CTGCGTGTAC CGGACGATGC CCATCTCTTA 3 60 

CGTCGTCCTA TTTCAATTTC GTCTATTGAC AAGGCAAACA AGCAGTGTCA CCTCATTTAT 42 0 

CGGATTGACG G AG CTGGGAC TGCAATTTTT TCAACCTTAA GTCAGGGAGA CACTCTTGAT 480 

GTGATGGGGC CTCAGGGAAA TGGTTTTGAC TTGTCTGACC TTGATGAGCA GAATCAGGTT 54 0 

CTCCTTGTTG GTGGTGGGAT TGGTGTTCCA CCCTTGCTTG AGGTGGCCAA GG AAT TG CAT 6 00 

GAACGTGGAG TGAAAGTAGT GACAGTCCTC GGTTTTGCTA ATAAGGATGC TGTTATTTTG 6 60 

AAAACGGAAT TGGCTCAGTA TGGTCAGGTC TTTGTAACGA CAGATGATGG TTCTTATGGC 72 0 

ATCAAGGGAA ATGTTTCCGT TGTTATCAAT GATTTAGACA GTCAGTTTGA TGCTGTTTAC 780 

TCGTGTGGGG CTCCAGGAAT GATGAAGTAT AT C AAT C AAA CCTTTGATGA TCACCCAAGA 84 0 

GCCTATTTAT CTCTGGAATC TCGTATGGCT TGTGGGATGG GAGCTTGCTA TGCCTGTGTT 900 

CTAAAAGTAC CAGAAAACGA GACGGTCAGC CAACGCGTCT GTGAAGATGG TCCTGTTTTC 960 
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CGCACAGGAA CAGTTGTATT ATAAGGAGAA AATTATGACT ACAAATCGAT TACAAGTTTC 102 0 

TCTACCTGGT TTGGATTTGA AAAATCCGAT TATTCCAGCA TCAGGCTGTT TTGGCTTTGG 1080 

ACAAGAGTAT GCCAAGTACT AT GAT T TAG A CCTTTTAGGT TCTATTATGA TCAAGGCGAC 114 0 

AACCCTTGAA CCACGTTTTG GGAATCCAAC TCCAAGAGTG G C AG AG AC G C CTGCTGGTAT 12 00 

GCTCAATGCA ATTGGCTTGC AAAATCCTGG TTTAGAGGTT GTTTTGGCTG AAAAGCTACC 12 60 

TTGGCTGGAA AGAGAATATC CAAATCTTCC TATTATTGCC AATGTAGCTG GTTTTTCAAA 13 2 0 

ACAAGAGTAT GCAGCTGTTT CTCATGGGAT TTCCAAGGCA ACTAATGTAA AAGCTATCGA 13 8 0 

GCTCAATATT TCTTGTCCCA ATGTTGACCA CTGTAATCAT GGACTTTTGA TTGGTCAAGA 14 4 0 

T C C AG AT T TG GCTTATGATG TGGTGAAAGC AGCTGTGGAA GCCTCAGAAG TGCCAGTTTA 1500 

TGTCAAATTA ACCCCGAGTG TGACCGATAT CGTTACTGTC GCAAAAGCTG CAGAAGATGC 1560 

GGGAGCAAGT GGCTTGACCA TGATCAATAC TCTGGTTGGA ATGCGCTTTG ACCTCAAAAC 162 0 

TAGAAAACCA ATCTTGGCCA ATGGAACAGG TGGAATGTCT GGTCCAGCAG TCTTTCCAGT 168 0 

AGCCCTCAAA CTCATCCGCC AAGTTGCCCA AACAACAGAC CTGCCTATCA TTGGAATGGG 1740 

AGGAGTGGAT TCGGCTGAAG CTGCCCTAGA AATGTATCTG GCTGGGGCAT CTGCTATCGG 1800 

AGTTGGAACA GCTAACTTTA CCAATCCTTA TGCCTGCCCT GACATCATCG AAAATTTACC 18 60 

AAAAGTCATG GATAAATACG GTATTAGCAG TCTGGAAGAA CTCCGTCAGG AAGTAAAAGA 192 0 

GTCTCTGAGG TAAACTGCAA TCAATCTGTT CTTGATTTTT TATTAGTTTG TAATATGAAT 1980 

TTAGGAGAAT TTTGGTACAA TAAAATAAAT AAGAACAGAG GAAGAAGGTT AATGAAGAAA 204 0 

GTAAGATTTA TTTTTTTAGC TCTGCTATTT TTCTTAGCTA GTCCAGAGGG TGCAATGGCT 2100 

AGTGATGGTA CTTGGCAAGG AAAACAGTAT CTGAAAGAAG ATGGCAGTCA AGCAGCAAAT 2160 

GAGTGGGTTT TTGATACTCA TTATCAATCT TGGTTCTATA TAAAAGCAGA TGCTAACTAT 222 0 

GCTGAAAATG AATGGCTAAA GCAAGGTGAC GACTATTTTT ACCTCAAATC TGGTGGCTAT 22 80 

ATGGCCAAAT CAGAATGGGT AGAAGACAAG GGAGCCTTTT ATT AT C T T G A CCAAGATGGA 2 34 0 

AAGATGAAAA GAAATGCTTG GGTAGGAACT TCCTATGTTG GTGCAACAGG TGCCAAAGTA 2 4 00 

ATAGAAGACT GGGTCTATGA TTCTCAATAC GATGCTTGGT T TT AT AT C AA AGCAGATGGA 2460 

CAGCACGCAG AGAAAGAATG GCTCCAAATT AAAGGGAAGG ACT ATT AT TT CAAATCCGGT 2 52 0 

GGTTATCTAC TGACAAGTCA GTGGATTAAT CAAGCTTATG TGAATGCTAG TGGTGCCAAA 2580 

GTACAGCAAG GTTGGCTTTT TGACAAACAA TACCAATCTT GGTTTTACAT CAAAGAAAAT 2 64 0 

GGAAACT AT G CTGATAAAGA ATGGATTTTC G AG AATGGT C ACTATTATTA TCTAAAATCC 2 7 00 
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GGTGGyTACA 


TGGCAGCCAA 


TGAATGGATT 


TGGGATAAGG 


AATCTTGGTT 


TTATCTCAAA 


2760 


TyTGATGGGA 


AAATrGCTGA 


AAAAGAATGG 


GTCTACGATT 


CTCATAGTCA 


AGCTTGGTAC 


2820 


TACTTCAAAT 


CCGGTGGTTA 


CATGACAGCC 


AATGAATGGA 


TTTGGGATAA 


GGAATCTTGG 


2880 


TTTTACCTCA 


AATCTGATGG 


GAAAATAGCT 


GAAAAAGAAT 


GGGTCTACGA 


TTCTCATAGT 


2940 


CAAGCTTGGT 


ACTACTTCAA 


ATCTGGTGGC 


TACATGGCGA 


AAAATGAGAC 


AGTAGATGGT 


3000 


TATCAGCTTG 


GAAGCGATGG 


TAAATGGCTT 


GGAGGAAAAA 


CTACAAATGA 


AAATGCTGCT 


3060 


TACTATCAAG 


TAGTGCCTGT 


TACAGCCAAT 


GTTTATGATT 


CAGATGGTGA 


AAAGCTTTCC 


3120 


TATATATCGC 


AAGGTAGTGT 


CGTATGGCTA 


GATAAGGATA 


GAAAAAGTGA 


TGACAAGCGC 


3180 


TTGGCTATTA 


CTATTTCTGG 


TTTGTCAGGC 


TATATGAAAA 


C AG AAGATT T 


ACAAGCGCTA 


3240 


GATGCTAGTA 


AGGACTTTAT 


CCCTTATTAT 


GAGAGTGATG 


GCCACCGTTT 


TTATCACTAT 


3300 


GTGGCTCAGA 


ATGCTAGTAT 


CCCAGTAGCT 


TCTCATCTTT 


CTGATATGGA 


AGTAGGCAAG 


3360 


AAATATTATT 


CGGCAGATGG 


CCTGCATTTT 


GATGGTTTTA 


AGCTTGAGAA 


TCCCTTCCTT 


3420 


TTCAAAGATT 


TAACAGAGGC 


T AC AAAC T AC 


AGTGCTGAAG 


AATTGGATAA 


GGTATTTAGT 


3480 


TTGCTAAACA 


TTAACAATAG 


CCTTTTGGAG 


AACAAGGGCG 


CTACTTTTAA 


GGAAGCCGAA 


3540 


GAACATTACC 


ATATCAATGC 


TCTTTATCTC 


CTTGCCCATA 


GTGCCCTAGA 


AAGTAACTGG 


3600 


GGAAGAAGTA 


AAATTGCCAA 


AGATAAGAAT 


AATTTCTTTG 


G C ATT AC AG C 


CT AT GAT AC G 


3660 


ACCCCTTACC 


TTTCTGCTAA 


GACATTTGAT 


GATGTGGATA 


AGGGAATTTT 


AGGTGCAACC 


3720 


AAGTGGATTA 


AGGAAAATTA 


TATCGATAGG 


GGAAGAACTT 


TCCTTGGAAA 


CAAGGCTTCT 


3780 


GGTATGAATG 


TGGAATATGC 


TTCAGACCCT 


TATTGGGGCG 


AAAAAATTGC 


TAGTGTGATG 


3840 


AT G AAAAT C A 


ATGAGAAGCT 


AGGTGGCAAA 


GATTAGTACT 


ATAAGTGAAT 


ATGATTTGAG 


3900 


TGAATAGTAA 


GTTAAAAATC 


CTGATTTCAA 


GT AAAAT C AG 


GATTTTTTCA 


TGGATGCAAT 


3960 


TTTTTTGGAG 


TCTGGTGTGA 


CGCGGAGGGT 


CTTTTGTCCT 


GTGTAAGTGA 


CAAAGCCGGG 


4020 


TTTTCCACCA 


GTTGGTTTAT 


TGAGTTTTTT 


GACTTCAATC 


ATATCTACCT 


GCACCAGATT 


4 080 


CGACAGGCGC 


CCTTGAGAGA 


AGTAGGCAGC 


TAACTCTGCT 


GCGTCTGTCT 


TGACTGCATC 


4140 


AG ATGGGT C A 


AGATTTCCTG 


AGATGACAAC 


ATGGCTTCCA 


GGAATGTCCT 


TAGCATGGAA 


4200 


CCAAAGTTCC 


TCCTTGCGGG 


CCATTTTAAA 


GGTCAATTCC 


TCATTTTGAA 


GATTGTTTCG 


4260 


TCCGACATAG 


ATGATGGTTT 


TGCCATCGCT 


TGC T AG AT AT 


TGTTCTAGTT 


TTTTGCGTTT 


4320 


CTGGATTTTC 


TCCCGTTGTC 


TTCTGCGGAT 


AAAACCTGTT 


TGAATCAATT 


CTTCACGGAT 


4380 


TTCAGCGATT 


TCTTCCAGTC 


CAGCTTGGTT 


GAGGACGGTT 


T CT AC AC T T T 


C C AG AT AG AG 


4440 


AATAGTGGCT 


TTGGTTTCTT 


CAATCAAATC 


AGTCAAGTAT 


TTGACAGCTT 


CTTTGAGTTT 


4500 



WO 98/18931 



PCT7US97/19588 



329 

CTGATACCGT TTAAAATAGC GTTGGGCATT CTGGTTGGGA GT C AG AG C CT TATCAAGCGC 4 5 60 

AATCATGATA GGTTGGTTGG TATAGTAGTT GTCTAGGATA ACCTGGTCTT GGTCGTTAGG 4 62 0 

CACTTGGTGG AGGAAGGTTG TCAGCAATTC TCCTTTTTGA CGAAATTCTT CAGCGTTGTC 4 68 0 

TGTCGCCAGT AACTCTTTTT CCTGTTTTTT GAGTTTGTGT CGGTTTTTCT GAAGTTCATT 474 0 

TTCAACACGA CGAATCAGTT CACTGGCCTG CTGTTTGACG CGGTCGCGCT C AG C C TT AT C 4800 

CTTATAGTAG GTGTCCAACA AATCAGAAAG ATTTGCAAAA GGCTCTCCCA CCTGATTTGC 4860 

AAAAGGAACT GGACTGAAGG AAGTCTCAGT CAAGCATGGC TTGGTTTCTT GATTGAAAAA 492 0 

ATTTCGGAAA GCGGAAAGTT TTTCACTAAC CAGTATCCTT TCCAATTCAT TTGCCGTATC 4980 

GCGTCCCAGA CCTTGAAAGA GGCTTTGAAG ATTTTTTGCT GTTAGTTCTT GGGTTTGCAG 504 0 

GATTTCAAAG AGCTTTTCAT CCTTGATAGT AAAAGGATTG AGAGATTTTG TACTTGGCGG 5100 

AG CG AT AT AG GTCGATCCTG GAAGTAAGGT GCGGTAGCTA TTTTGTGAAA AGCCGACGTG 5160 

TTTGATAACT TCGAGGATTT TATGACTGCT TTTATCGACC AGTAGAATAT TACTGTGTTT 522 0 

CCCCATAATT TCGATAATCA AGGTAGCCTG GATATGGTCT CCAATCTCGT TTTT AT TGG A 528 0 

AACTGTAATT TCCACAATAC GGTCATTTTC CACTTGCTCA ATCGACTCAA TCAGGGCCCC 53 4 0 

CTGCAAATAC TTTCTCAAAA CCATGATAAA GGTAGAAGGT TGAGCTGGAT TTTCAAAAGT 5400 

CGTTTGGGTC AGCTGAATGC GTCCAAAAAC TGGATGGGCA GAAAGGAGCA GGCGATGGCT 54 60 

TTGGCGATTG CTGCGGATTT GCAAGACCAA CTCTTGTTCA AAAGGCTGAT TGATTTTCTG 552 0 

GATGCGACCA TTCACTAATT CGCTTCGCAA TTCCTCAACT ATGTGGTGTA AAAAAAATCC 55 8 0 

GTCAAATGAC ATCGTTCTCT CCTTGTGATT GTATTCCATA GT AT TAT AT C AAAAAGGTAG 5640 

AATAAAATCA TGGAAATGTG GTATAATAAA GCCAAGTAAA GAGAAACGAG AAGCACATGT 5700 

ATATTGAAAT GGTAGATGAA ACTGGTCAAG TTTCAAAAGA AATGTTGCAA CAAACCCAAG 57 60 

AAATTTTGGA ATTTGCAGCC CAAAAATTAG GAAAAGAAGA CAAGGAGATG GCAGTCACTT 5 82 0 

TTGTGACCAA TGAGCGTAGT CATGAACTTA ATCTGGAGTA CCGTAACACC GACCGTCCGA 5880 

CAGATGTCAT CAGCCTTGAG TATAAACCAG AATTGGAAAT TGCCTTTGAC GAAGAGGATT 5940 

TGCTTGAAAA TTCAGAATTG GCAGAGATGA TGTCTGAGTT T GAT G C CT AT ATTGGGGAAT 6000 

TGTTCATCTC TATCGATAAG GCTCATGAGC AGGCCGAAGA AT AT GGT C AC AGCTTTGAGC 6060 

GTGAGATGGG CTTCTTGGCA GTACACGGCT TTTTACATAT TAACGGCTAT GATCACTACA 612 0 

CTCCGGAAGA AGAAGCGGAG ATGTTCGGTT TACAAGAAGA AATTT TG AC A GCCTATGGAC 6180 

T C AC AAGAC A ATAAACGAAA ATGGAAAAAT CGTGACTTGA TATCCAGTTT AGAATTTGCT 6240 
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TTGACAGGTA 


TTTTTACTGC 


TATCAAGGAA 


GAACGCAATA 


TGCGAAAACA 


CGCAGTGACG 


6300 


GCTCTAGTGG 


TCATCCTTGC 


AGGTTTTGTT 


TTTCAGGTGT 


CACGAATCGA 


ATGGCTCTTT 


6360 


CTCCTATTGA 


GTATTTTCTT 


GGTAGTAGCC 


TTTGAGATTA 


TCAACTCTGC 


TATTGAAAAT 


6420 


GTGGTGGATT 


TGGCCAGTCA 


CTATCACTTT 


TCCATGCTGG 


CTAAAAATGC 


CAAGGATATG 


6480 


GCGGCCGGCG 


CGGTATTAGT 


GGTTTCTCTT 


TTCGCAGCCT 


TAACAGGCGC 


ATTGATTTTT 


6540 


CTCCCACGAA 


TCTGGGATTT 


ATTATTTTAA 


ACAGTAAGAG 


GAAATTATGA 


CTTTT AAATC 


6600 


AGGCTTTGTA 


GCCATTTTAG 


GACGTCCCAA 


TGTTGGGAAG 


TCAACCTTTT 


TAAATCACGT 


6660 


TATGGGGCAA 


AAGATTGCCA 


TCATGAGTGA 


CAAGGCGCAG 


AC AACGCG C A 


ATAAAATCAT 


6720 


GGGAATTTAC 


ACG AC T GAT A 


AGGAGCAAAT 


TGTCTTTATC 


G AC AC AC C AG 


GGATTCACAA 


6780 


GCCTAAAACA 


GCTCTCGGAG 


ATTTCATGGT 


TGAGTCTGCC 


TACAGTACCC 


TTCGCGAAGT 


6840 


GGACACTGTT 


CTTTTCATGG 


TGCCTGCTGA 


TGAAGCGCGT 


GGTAAGGGGG 


AC G AT ATG AT 


6900 


TATCGAGCGT 


CTCAAGGCTG 


CCAAGGTTCC 


TGTGATTTTG 


GTGGTGAATA 


AAATCGATAA 


6960 


GGTCCATCCA 


GACCAGCTCT 


TGTCTCAGAT 


TGATGACTTC 


CGTAATCAAA 


TGGACTTTAA 


7020 


GGAAATTGTT 


CCAATCTCAG 


CCCTTCAGGG 


AAATAACGTG 


TCTCGTCTAG 


TGGATATTTT 


7080 


GAGTGAAAAT 


CTGGATGAAG 


GTTTCCAATA 


TTTCCCGTCT 


G AT C AAATC A 


CAGACCATCC 


7140 


AGAACGTTTC 


TTGGTTTCAG 


AAATGGTTCG 


CGAGAAAGTC 


TTGCACCTAA 


CTCGTGAAGA 


7200 


G ATT C CGC AT 


TCTGTAGCAG 


TAGTTGTTGA 


C T CT AT G AAA 


CGAGACGAAG 


AGACAGACAA 


7260 


GGTTCACATC 


CGTGCAACCA 


TCATGGTCGA 


GCGCGATAGC 


C AAAAAGG G A 


TTATCATCGG 


7320 


TAAAGGTGGC 


GCTATGCTTA 


AGAAAATCGG 


TAGCATGGCC 


CGTCGTGATA 


TCGAACTCAT 


7380 


GCTAGGAGAC 


AAGGTCTTCC 


TAGAAACCTG 


GGTCAAGGTC 


AAGAAAAACT 


GGCGCGATAA 


7440 


AAAGCTAGAT 


TTGGCTGACT 


TTGGCTATAA 


TGAAAGAGAA 


TACTAAGTAG 


AGGTAGGCTC 


7500 


ATGCCTGCTT 


CTTGTTTTTA 


CAGAAGGAGG 


ACTTATGCCT 


GAATTACCTG 


AGGTTGAAAC 


7560 


CGTTTGTCGT 


GGCTTAGAAA 


AATTGATTAT 


AGGAAAGAAG 


ATTTCGAGTA 


TAGAAATTCG 


7620 


CTACCCCAAG 


ATGATTAAGA 


CGGATTTGGA 


AGAGTTTCAA 


AGGGAATTGC 


CTAGTCAGAT 


7680 


TATCGAGTCA 


ATGGGACGTC 


GTGGAAAATA 


TTTGCTTTTT 


TATCTGACAG 


ACAAGGTCTT 


7740 


GATTTCCCAT 


TTGCGGATGG 


AGGGCAAGTA 


TTTTTACTAT 


CCAGACCAAG 


GACCTGAACG 


7800 


CAAGCATGCC 


CATGTTTTCT 


TTCATTTTGA 


AGATGGTGGC 


ACGCTTGTTT 


ATGAGGATGT 


7860 


TCGCAAGTTT 


GGAACCATGG 


AACTCTTGGT 


GCCTGACCTT 


TTAGACGTCT 


ACTTTATTTC 


7920 


TAAAAAATTA 


GGTCCTGAAC 


CAAGCGAACA 


AGACTTTGAT 


TTACAGGTCT 


TTCAATCTGC 


7980 


CCTTGCCAAG 


TCCAAAAAGC 


CTATCAAATC 


CCATCTCCTA 


G AC C AG AC C T 


TGGTAGCTGG 


8040 
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ACTTGGCAAT ATCTATGTGG ATGAGGTTCT CTGGCGAGCT CAGGTTCATC CAGCTAGACC 8100 

T T C C C AG ACT TTGACAGCAG AAGAAGCGAC TGCCATTCAT GACCAGACCA TTGCTGTTTT 8160 

GGGCCAGGCT GTTGAAAAAG GTGGCTCCAC CATTCGGACT TATACCAATG CCTTTGGGGA 82 2 0 

AGATGGAAGC ATGCAGGACT TTCATCAGGT CTATGATAAG ACTGGTCAAG AATGTGTACG 82 80 

CTGTGGTACC AT C AT TG AG A AAATTCAACT AGGCGGACGT GGAACCCACT TTTGTCCAAA 83 4 0 

CTGTCAAAGG AGGGACTGAT GGGAAAAATC ATCGGAATCA CTGGGGGAAT TGCCTCTGGT 84 00 

AAGTCAACTG TGACAAATTT TCTAAGACAG CAAGGCTTTC AAGTAGTGGA TGCCGACGCA 84 60 

GTCGTCCACC AACTACAGAA ACCTGGTGGT CGTCTGTTTG AGGCTCTAGT ACAGCACTTT 852 0 

GGG C AAG AAA TCATTCTTGA AAACGGAGAA CTCAATCGCC CTCTCCTAGC TAGTCTCATC 85 80 

TTTTCAAATC CTGATGAACG AG AATGGT CT AAGCAAATTC AAG GGG AG AT T AT CCGT G AG 8 64 0 

GAACTGGCTA CTTTGAGAGA ACAGTTGGCT CAGACAGAAG AGATTTTCTT CATGGATATT 87 00 

CCCCTACTTT TTGAGCAGGA CTACAGCGAT TGGTTTGCTG AGACTTGGTT GGTCTATGTG 87 6 0 

GACCGAG AT G CCCAAGTGGA ACGCTTAATG AAAAGGGACC AGTTGTCCAA AG AT G AAGCT 8 82 0 

GAGTCTCGTC TGGCAGCCCA GTGGCCTTTA GAAAAAAAGA AAG AT T T G GC CAGCCAGGTT 8880 

CTTGATAATA ATGGCAATCA GAACCAGCTT CTTAATCAAG TGCATATCCT TCTTGAGGGA 8 94 0 

GGTAGGCAAG ATGACAGAGA TTAACTGGAA GGATAATCTG CGCATTGCCT GGTTTGGTAA 900 0 

TTTTCTGACA GGAGCCAGTA TTTCTTTGGT TGTACCTTTT ATGCCCATCT TCGTGGAAAA 9 0 60 

TCTAGGTGTA GGGAGTCAGC AAGTCGCTTT TTATGCAGGC TTAGCAATTT CTGTCTCTGC 912 0 

TATTTCCGCG GCGCTCTTTT CTCCTATTTG GGGTATTCTT GCTGACAAAT ACGGC C G AAA 9180 

ACCCATGATG ATTCGGGCAG GTCTTGCTAT GACTATCACT ATGGGAGGCT TGGCCTTTGT 92 4 0 

CCCAAATATC TATTGGTTAA TCTTTCTTCG TTTACTAAAC GGTGTATTTG CAGGTTTTGT 93 00 

TCCTAATGCA ACGGCACTGA TAG C C AGT C A GGTTCCAAAG G AG AAAT C AG GCTCTGCCTT 93 60 

AGGTACTTTG TCTACAGGCG TAGTTGCAGG TACTCTAACT GGTCCCTTTA TTGGTGGCTT 94 2 0 

TATCGCAGAA TTATTTGGCA TTCGTACAGT TTTCTTACTG GTTGGTAGTT TTCTATTTTT 94 80 

AG C TG C T ATT TTGACTATTT GCTTTATCAA GGAAGATTTT CAACCAGTAG CCAAGGAAAA 954 0 

GGCTATTCCA ACAAAGGAAT TATTTACCTC GGTTAAATAT CCCTATCTTT TGCTCAATCT 9 600 

CTTTTTAACC AGTTTTGTCA TCCAATTTTC AGCTCAATCG ATTGGCCCTA TTTTGGCTCT 9660 

TTATGTACGC GACTTAGGGC AGACAGAGAA TCTTCTTTTT GTCTCTGGTT TGATTGTGTC 972 0 

CAGTATGGGC TTTTCCAGCA TGATGAGTGC AGGAGTCATG GGC AAGCT AG GTGACAAGGT 9780 
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GGGC AAT CAT CGTCTCTTGG TTGTCGCCCA GTTTTATTCA GTCATCATCT ATCTCCTCTG 



9840 



TGCCAATGCC TCTAGCCCCC TTCAACTAGG ACTCTATCGT TTCCTCTTTG GATTGGGAAC 



9900 



CGGTGCCTTG ATTCCCGGGG TTAATGCCCT ACTCAGCAAA ATGACTCCCA AAGCCGGCAT 



9960 



TTCGAGGGTC TTTGCCTTCA ATCAGGTATT CTTTTATCTG GGAGGTGTTG TTGGTCCCAT 



10020 



GGCAGGTTCT GCAGTAGCAG GTCAATTTGG CTACCATGCT GTCTTTTATG CGACAAGCCT 



10080 



TTGTGTTGCC TTTAGTTGTC TCTTTAACCT GATTCAATTT CGAACATTAT TAAAAGTAAA 



10140 



GGAAATCTAG TGCGAGTAAA AATCAATCTC AAATGCTCCT CTTGTGGCAG T AT C AAT T AC 



10200 



CTAACCAGTA AAAATTCAAA AACCCATCCA GACAgATTGA 



10240 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGCTTTATCG TGGACGTGGT CAAGCCGAGA ATTTCATCAA GGAGATGAAG GAGGGATTTT 60 

TTGGCGATAA AACGGATAGT TCAACCTTAA TCAAAAACGA AGTTCGTATG ATGATGAGCT 12 0 

GTATCGCCTA CAATCTCTAT CTTTTTCTCA AACATCTAGC TGGAGGTGAC TTCCAAACTT 18 0 

TAACAATCAA ACGCTTCCGC CATCTTTTTC TTCACGTGGT GGGAAAATGT GTTCGAACAG 24 0 

GACGCAAGCA GCTCCTCAAA TTGTCTAGTC TCTATGCCTA TTCCGAATTG TTTTCAGCAC 3 00 

TTTATTCTAG G AT T AGAAAA GTCAACCTGA ATCTTCCTGT TCCTTATGAA CCACCTAGAA 3 60 

GAAAAGCGTC GTTAATGATG CATTAAAGAA CAGTCGAGAT GAAAAAATCG TGTGACGCAC 42 0 

CAAGGGAGGA GTCTGCCCTT TTGAGGAAAT CTAGCGAGGA AAAAC G AT AC TGGAACAGCA 480 

GAAAGTAAAA CTGACCTCAT GAGGAGGAAG AAAGTGGCTC ATGAGGTCAG GGGTTTTGTA 540 

AGTTACATCT AGTTGAGAGA GGTATGAATG ATTTGGGATT AATCATTTCT TGTTTTAAAT 600 

CAGGAGAATA GTAACGATTT TTTCCTTTTT TGACGAACTC TATTCCGTAA CGATCAATCA 6 60 

ATTTAATCAT GTACCTAATA TTAGAATTGT TTATCCCAAA TTTATTTGAA AGCTTCTCTA 72 0 

AGCTATATCC TTGTTTTCTA AGTTCATAGA TCTGAACTTT ATCATCATAA GTTAGTTTCA 780 

TAATAAAAAC ACCCCAAAAG TTAGATTTTT TCTGTCTAAC TTTTGGGGGG CAGTTCATTC 84 0 

AACACCTGAT ACTATGCGTT TTTCTTATTT GAAATACTTT TTACTCAACC TCTTTATACT 900 

CAATGAAAAT CAAAGTGCAA ACT AG AAAG C TAGCCTCAGG CTGCTCAAAA CAGTGTTTTG 9 60 
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AGGTTGCAGA TGGAAGCTGA CGTGGTTTGA AG AG AT T T T C GAAGAGTATT ACTTAATCTT 102 0 

CTTGATACTT TGACTAAGAA TAAATCCTAC AATCATCCCT ACCATATTTT G C AT AAAATT 108 0 

CGGTAGAATT TCTGGGAGGG CTGCTGCCCA GCCATTCATC AAAGCAGAAC CCAAGGCGTA 1140 

GCCTCCTACC ATGGCAATAG TTGCTAAAAT AAGGCCTAAC CACTGACTTT TTCCTTTAAA 1200 

TCCTGCGAAA AATCCCTGCA AGCCATGGTT GACCAAGCTA AAGAACATCC ACTGAGGGTA 12 60 

GCCTGATAAG AGGTCAATCA AGAAACTTGC TAGTCCTCCG ACTACCGCTC CTTCACGACT 13 2 0 

ACCAAAGTAA AAGGCCGCAA AG AAG AC AC C AGCATCTAAA AGAGTTAGAA TTCCTGTAGG 13 80 

TGTTGGGATT TTTAAGAAAT AACCTAGAAC CACAGAAAGG GCGGTTAATA GGGATACAAG 1440 

GGCGATTTTA GTTGTTTTTG TTTGCTTCAT ATTGTCTTAC TCCATACTGA TCTGCTTGTG 1500 

CAATAGCACG ATAAACGAAA GCCTTAGAGC TTTCTACTGC TGGCAAAAGT TTATCACCTT 15 60 

TAACCAGGTG ACTGGCAATG C TAG AG S C AA AGGTACAACs TGCACCAGCA TTTTGGCCTT 1620 

GGATAACTGG ATTTTCTAGG ATAGTAAAGG TCTGTCCATC ATAAAAGACA TCCACAGCCT 1680 

TGTCCTGACT AAGACGATTG CCTCCCTTGA TAATGACTGt GGCGCTCCTA AATCATGCAA 1740 

TTTCTGCGCT GCAGTTTTCA TGTCTTCCAA GGTTTTAATT TCCTGACCGG ATAATAATTC 1800 

TGCTTCTGGG AGATTAGGCG T AAT C AC ACT GACATAAGGG AAAAAGCGAA TCAACTCTTG 18 60 

GCAGAGCTCA CTGACAGCTA CAT C AT GCGT TTCCTTGCAG ACCAAGACAG GATCCAACAC 192 0 

CACAGGTACT CCTGGGCGTT GTTTGATAAA GTCCAAGGCC TTCTCAGCCA CGCTGACAGT 1980 

AGGGAGAAGA CCAATCTTAA TTCCCCCAAA TTCCACATCA CGCAAGCTAT CTAATTCATG 2 04 0 

TTGAAAAATG GT AT CATC AG TTGGAAAGAC TTCAAATCCT TTTTCTGTCA AGGCTGTCAA 2100 

ACAAGTCACT GCTACAAACC CATGCAAGCC GTTCAAGGTA TAGGTAGCCA AATCAGCTGA 2160 

CAGTCCACCA C C AC T AAAAA TATCATTTCC AGAAAGTGCT AAAATACGAT TATTCTTCAT 2220 

AACGAATCTC CTTTAAATAC AAACCATTTG GTGCTGCAGT GGGACCTGCA AGTTGCCTGT 22 8 0 

CCTTCTTCTC CAAGATGAGA TCAATCTGCT CTACTGGCAT GCGGTTGTTA CCGATTTTGA 2340 

GAAGAGTCCC CACCATATTG CGAATCTGTT TATACAAGAA ACCATTTCCT GAAAAGGTAA 2400 

AGGTCAAAAA TTGTCCTGTC T C ATCG ACT A TTAAAC TAG C TTCTGTGATG GTGCGAACCT 24 60 

TATCCTCTAC ACTAGTCCCA GAGGCTGTAA AAC CGGT AAA ATCATGGGTT CCCTCTAGCT 252 0 

TTTTGATTGC AATCTGCATT CGTTCCACAT CGAGTGGGTA GGGAAAGTGG GTGGCATAGT 2 580 

GACGGCGCAT CGGATTTTTG GGACGTCCTC TATCCACAGT AAACTCATAG GTCTTGCTAT 2 64 0 

GCTTGGCATA ACGGC AAT G A AAATCATCTG CCACAAGCTC AATCGAAATC ACATCAATAT 27 00 
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CTTCAGGAGA CTGGGTATCC AAGGCAAAAC GGAGTTTCTC CTCATCCATC TGATAAGGCA 2760 

GGTCAAAATG AATCACCTGT CCCAGGGCAT GAACCCCACT ATCTGTCCTA CCAGCACCGT 2 820 

GAACAGTAAT GGCTTGCCCT TTATTTAATC TGGTCAAGGT TTTTTCAATT TCTTCCTGAA 2 880 

CGCTACGCGC ATGAGGCTGG CGCTGAAAGC CAGCAAAGGC ATAACCATCA TAGGAAATAG 2 940 

TTGC TTT AT A TCTCGTCATA GCCTCTATTT TATCAAGAAA TTAGTCTGTA AACAAGGACC 3 000 

TAAAACAAAT ATTGTATGGG T AT AAAAAT C TCATACTCTT CGAAAATCTC TTCAAACCAC 3060 

GTCAGTTTCC ATCTGCAACC TCAACACACT ATTTTGAGCA ACCTGCGGCT AGCTTTCTAT 312 0 

AGTAGATTGA AATAAGATAT GAACAACTCT ATTAGGAAAG TCAAATTAAT TTCTAGAAAT 3180 

ATTT T AG C AG CTACAGCGTA CTATTCCAAA CTCAATCAAC TATAGTTTGC TCTTTGATTT 32 40 

TCATTGAGTA TCAAAAGAAA AACTTAGGAA TCAATCCTAA GCTCTCTTCT GAAGTAGGTA 3 3 00 

CAT G AC AAAG ATAGAGATTA CAATCAACCA ACCTCCTAAG ATACTAAAGA CCAACATCCC 33 60 

ATTGTGAGTT AGTAAGCCAA TTGCACCTAG AACGAATGGG GTCGTAAAGG CTCCGAAACT 342 0 

ACAGCCTAAT ACAGCAAATG AAGTTGCTTG ATTGAGGAGT TTAGCTGGAA TTCGTTCAGA 3480 

GACAAGTTGA AAGACCGTCG T C AAG ACT AC ACT AT AGG C A AATCCAGCCA GAACACTTCC 3 540 

TGCTACTACC ACCCACAAGG AT G AAG AC AA GGCAATCACG ATTTGCCCCA AGCCAAAGGT 3600 

AATACCAGAC C AGAGGAG C A GTTTCTCTTT AAAGATAGAA AT C AAG AAAG AAAAACTCAC 3 660 

CCCAGCCACA ATCCCGATCA ACTGCATGAT ACTAAGAACA AAACTAGATA ACTGGGCATC 3720 

CCCCAATCCT CTTTCCACCA TCAAACTTGG AATACGGATG GTAATAGCTG TATTGGTACA 37 80 

AACTACAACT GCCGCTTCGA TAGCTAAGGT AAAAAT C AAG CCTTTCATTT CTCGAGTTAA 3 840 

ACGACTTGCT TCCTTCGCTC TTTTCTTGAC TTCTTTCTTT GATTTTCCAT AAGGGACAAA 3 90 0 

GAG C AG AT AA AGGGGCAGCA CCAAAAATCC AGCACTATAG GCTAGAAAGA TAGCTGTCCA 39 60 

ACCAAAGGCC AACAACTGAC CGACGGCCAA GGTAATGAGA GAAGCTCCAA CGACCTCTGC 402 0 

AGAAGCGCGT AGCCCTAACA TCTGAATTCG CCTTTTTCCT TGGTAGCGTT CACTGATAAT 4080 

AGAAATGGCC TTGGCATTGA TCATCCCAAG ACCCAAACCA AAG AG AAG C C GTGTTCCAAA 4140 

GACAAAGGGA TAGGCTTGGT AC C AGAAGGG AGCTGTACCG CTCAATGATA AAATCAGCAA 42 00 

GCCCAAACTA ATCTGTAAGC GCTCAGGAAA TATTTTTTCT AAG AAAC C AT TTAGCAGTAA 42 60 

CAT CATC ATG ATTCCAAAGG AAGGCAAGCT C AC C AAG AG C TCAATTTGTT CCTTAGAATA 4320 

ACCCTGATAA TAGTCAAACA TGGCTGGTAG GGCACTCGAA ATGGAAAAGG AGGTAATCAA 43 8 0 

AACGAGGGAG AG AG C C AAAA TGCTGGCCCG TTCTAAAAAT TGTTTCATGA AATCTCTTTC 4440 

TATATTTCTC TTAATCTTCT ACTTTTTTGA TAGTTATCAA ATAAGCAAGA AAAGAAGAAG 4 500 
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CCTCATTGGT TTGTAGACTC CTTCTTAAAT TCGAAAATGA ATCCCTTGTA TCTTATACTC 4 560 

AATGAAAATC AAAGAGCAAA CTAGGAAGCT AGCCGCAGGT TGTTCAAAAC AGTGTTTTGA 4 62 0 

GGTTGCAGAT GGAAACTGAC GTGGTTTGAA GAGATTTTCG AAGAGTATTA GGATGACTTT 4 680 

CTCTTGATTT GCTTGATAAA GTAGAAAATA AATCCTGCTA CCATATAGGC AACAAAGATA 474 0 

AT C AG AC AC C ACTTAAACAC AAC ATT C C AA CCCTTGTTCA CAT T C AAAAA GAAGTAAGGG 4 8 00 

AAAGG AT T AT CCTTGGCATT TGGAATATTG AGTTTTAGAA CCAAGCCATT AAAAAGAGCA 4 8 60 

AACATCATAT ACAGAAAGGG TAAAATGGTC CACACTGCTG GATCCCAAAT CTTGTATTGA 4 92 0 

CCCTGTTTGT CAAAAAAGAG GGTATCCGCT AAAAACCAGA TGGGAACGAT ATAGTGGCAA 4 980 

AGGAAATTTT CTAGGGTATA GAAATTAGTC GCAATGGGCG CCAAGAGGAA ATGGTAAATC 504 0 

AC AC AG GT AA T C ATG AT ACT CATGGTGACC CCACCTTTTA AGCGCAAGAG ACTTGGCCTT 5100 

TGCCAATTTT CACCTACACG GCTCATAACC TTTAGAAGAT AAAGGGTAAA AATAGTTACC 5160 

AAGAGGTTGG ACAGAACCGT GTAATAGAGA AGCATCCCAA AAC C AC C ATG CTTAGTAATT 522 0 

TCAAGATAAA CTCCCGTAAA AG C CG C TAG A AACAAGAAGA TACGGCTATA AAATACAAGT 52 8 0 

TTATAGTGTT TTGACATGCT TAAATCTTCC TCACAAACTC TGATTTAAGT TTCATGGCAC 53 4 0 

C AAAAC CAT C AATCTTACAG TCGATATTGT GGTCGCCTTC TACGATGCGG ATATTTTTCA 54 00 

CGCGCGTCCC TTGTTTCAAA TCTTTTGGCG CACCTTTTAC TTTCAAGTCC TTGATGAGAG 54 60 

TTACTGTATC ACCATCAGCC AATTTATTTC CGTTGGCATC GATAGCGACA AGACCTTCTT 5 52 0 

CTACTTCTGC AACTTCAGCA GGATTCCACT CATGAGCACA CTCTGGGCAA AC CAGT AG GG 5580 

CACCGTCTTC GTAGACATAC TCTGAGTTAC ATTTTGGACA ATTTGGTAAA TTGTTCATGG 5 640 

TTTCTCCTTA TCATCATTCA CTATTCTTTG AAAATCAAAA TTTCTCGAAC AGCAACTATT 57 00 

AT AC C C T AAA ATCAGCATTT TGACAAATTT AGAAAAAAAC CGATATCAAT CTATCGGCTT 57 60 

T T C T AC AT TT ACATTCTTTT TTCAGCTTCT GCTTTGATTT TTTCAACTAC TTCTTGAATG 5 82 0 

TTCAAACCAG TTGTATCAAG GTAGACAGCA TCCTCTGCTT GTTTGAGAGG AGAAGTCTCA 5 8 80 

CGATGACTAT CCTTGTAGTC ACGCGCAGCA ATTTCCTTTT TTAGGGTTTC AAGGTCTGTT 5 94 0 

TCAATTCCCT TGGCAATATT TTCCTTGTAA CGACGCTCTG CTCTCTCATC AACAGAAGCT 6000 

ACTAGGAAAA TTTTCAATTC TGCTTGTGGC AATACAACAG TTCCAATATC GCGACCATCC 6060 

ATGACAATCC CGCCTTGCTG GGCAATTTCT TGTTGGAGAG AAACCAGTTT CTCACGCACT 6120 

TGAGGAATTG CTGCAATAGC AGAAACATGA TTGGTCACTT CATTTTCACG GATAGGATGG 6180 

GTAATATCCA CATCTCCTAC AAAAACAAGC TGGTCTCCAG TTTCTGAACG TCCAAAGCTG 6240 



WO 98/18931 



PCT/US97/19588 



336 

ATTGGATGCT GGTCCAACAA GGCTAGAAGG GCTTCGACTT CTTCAACTCC TAATTGGTTC 6 300 

TTAAGAGCCA TATAGGTCGC TGCACGATAC ATAGCTCCTG TATCAAGGTA GGTGAATCCA 63 60 

AAATCCTTAG CAATAATCTT TGCGACCGTA CTCTTACCGC TGGAAGCAGG ACCATCAATA 642 0 

GCAATTTGAA TTGTTTTCAT ATCGGCTCCT ATTTTATTTT TATAACATCA CCTGGATTAG 64 80 

CAAACCAAGA TCCTGTAGCC ATGTGCCCAG GATTCAAGGC CTCTAACTGA GCAATGGAGA 6540 

TTCCTGCACG AGCGGCAATA GCTGCTTCCC CTTCTCCTGC GAGAACTTTA ATCGTTCCTT 6600 

CAGGATTAGC AGCTTCTTCT GAACTACTAG AAGTAGATTC TGGCTCTGAA CTCTGCTCAG 6660 

GCTGAGAACT ACTTGAAGAT GAGATTTGTA CTACACTGGC AT C AG AAT C A TGAAAGCCTT 67 2 0 

TTAAGGCTGC TGTGCGATTA CTCCCCCCCG ATGATAGATA GATGAGAACG ATGACCATCA 67 80 

CCACCACAAT TACAAAGAAA ATACTAGCTA GGATCGTCAA AATACGATTA GCCATCCTAT 6840 

CAGCCCCTCC GTGGTTTCGA TGCCGACGCT CTGCTCTTGA TTCTTCTTGA T CAT AG AT AT 69 00 

CTTCTTGCCA CGGTTCTTTT GCCATACCTT ACTCCTTGTT TTTTTTTACT TTTCTTATTA 69 60 

CAATATAAAT ATGAACATGA AAATCACACT TATACCTGAA CGATGTATCG CCTGTGGGCT 7 02 0 

TTGCCAAACT TATTCTGATT TATTTGATTA CCACGATAAT GGAATCGTGC GTTTTTACGA 7080 

TGACCCTGAC CAACTGGAAA AAGAAATTTC TCCTAGTCAG GATATCTTAG AGGCTGTTAA 714 0 

AAATTGCCCA ACTCGCGCCC TGATTGGAAA CCAGGAAGCC TAAATCAATG GCGATAATCC 72 00 

ACTCCCTCTA GTTTAGCACA TTTCCATGTA AAATT AT AG T CTTTTCACTT TATTTTTTTC 72 60 

TGTAAAATCA GGAAGGTCAC TTTTTTCTTT GATAAGATAA AGTGGTCTTT TTTTAGTCTC 732 0 

TAAATAAATC TTACTGATAT ACT TG C C GAG AATCCCAATG GTCAAGAGTT GAATGCCTCC 7 3 80 

AAGAAAGAGA ATAACAGCCA TCAGAGAGGT CCAACCAGAT GTCGGATTGC C C AAAAT GAG 744 0 

GGTCCGAACC ACAACAAAAA AGGTCATCAG CAGAGAAAGA AAACAAGATA GGAGACCAGC 7 500 

TACAAAGGCT ATAATCAAGG GAAAATCTGA AAAATTAATA ATCCCTTCAA TGGAGTAGAA 7560 

AAAGAGTTGC CTAAAACTCC AACTTGTCTT GCCAGCCTGC CTTTCGACAT TTGGATAGTC 7 62 0 

CAAATAGTAG GTTTTGAAAC CCACCCAGGC GAAGAGCCCC TTTGAAAAAC GATTGGACTC 7 680 

GGTCAAGCTT AAAATGGCAT CG AC T AC AG A CCTTCTCATC ATACGAAAAT CACGGACACC 774 0 

CGACGGCAGA GCTACTGGGC TGATTTTTTG CATGAGGCGA TAAAAGAGAA CAGCACAGAA 7800 

ACTGCGAAAG AAGGGTTCTC CCTCCCGACT AGTTCTCCGT GTCCCAACGC AGTCCAAGTC 7 860 

TACATTTTTG TCTAATACAT TTTTCATCTC AAACAACATA CTAGGAGGAT CTTGGAGGTC 792 0 

TGCATCCATC ACCACCACCA AATCTCCTGT CGCATATTGC AAGCCTGCAT AAAGGGCTGC 7 9 80 

TTCTTTGCCA AAATTTCGAG AGAAAGAAAT ATAATGGACT GCCGGATTTT GCTCCCGATA 8 04 0 
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GGCCTTTAAG 


AGTTCCAAGG 


TCCCATCACT 


TGATCCATCA 


TCGACAAAGA 


CATACTCGAT 


8100 


TTCTGTTTCC 


AAATCTGGAA 


GTAAAGCTTC 


C AG AG CC T G A 


TAAAAAAGAG 


GAAGTACTTC 


8160 


CTCTTCGTTT 


AAACAAGGGA 


CGATGATTGA 


AAT CAT CAT C 


TTAGTCTTCA 


AATCCATTTG 


8220 


GATGCTTGCT 


TTGCCAACGC 


CATGCGTCTT 


CACACATTTG 


GGTGATGTCG 


AGTTCTGCTT 


8280 


CCCAACCGAG 


TTCTGCTTTA 


GCTTTTGCCG 


GGTCTGAGTA 


GCAGGCAGCG 


ATATCACCTG 


8340 


GGCGACGTTC 


TACGATGCGG 


TAAGGAATAG 


GACGGCCCAC 


CGCTTTTTCC 


ATGTTTTGGA 


8400 


TAATTTCAAG 


AACTGAGTAA 


CCTTTACCAG 


TTCCAAGGTT 


ATAAACGTTT 


AGTCCTGAAC 


8460 


CTTTTTGGAT 


TTTTTTCAAA 


GCTGCAACGT 


GACCCTTAGC 


CAAATCGACA 


ACGTGGATAT 


8520 


AGTCACGAAC 


ACCTGTTCCA 


TCTTCCGTAT 


CGTAATCGTC 


T C C AAAC ACT 


TGCACTTGCT 


8580 


CTAATTTTCC 


AACGGCTACT 


TGAGTCACAT 


ATGGCAAGAG 


ATTGTTTGGA 


ATACCGTTTG 


8640 


GATTTTCTCC 


CAAATCACCA 


CTCTCATGGG 


CTCCGATTGG 


GTTAAAGTAA 


CGAAGCAAGA 


8700 


CAACATTCCA 


TTCTGAGTCT 


GCTTTGTAAA 


TATCAGTCAA 


AATTTCCTCT 


AGCATGAGCT 


8760 


TAGTACGACC 


GTATGGGTTG 


GTCACTGAAA 


GTGGGAAATC 


TTCCAAGATG 


GGCACTGTGT 


8820 


GCGGATCCCC 


GTAAACTGTC 


GCAGAAGAAC 


TGAAGATGAT 


GTTTTTACAG 


TTGTTTTCTT 


8880 


CCATGGCTTT 


CAAAAGGCTG 


ACAGTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


8940 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CAGCAAAGTG 


AATGACACCA 


GTCGGTTCTT 


9000 


CCTGCTTGAA 


AATATCTCTG 


AGGGTATCTG 


TGTCACGAAT 


ATCTGCCTCA 


TAGAAAGGAA 


9060 


TCTCAACTCC 


TGTGATTCCT 


TCAACAACTT 


CTAAACTCTT 


ACGATTGCTA 


TTGACAAGAT 


9120 


TATCCACCAC 


AACAACTTGA 


TGACCTGCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


9180 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


CGATTCTCAG 


9240 


ATTATTTTTT 


CTTATTTTAC 


CATTTTTGAC 


AGGGAATGTC 


ATTTGCCATC 


CTAAACTACC 


9300 


TGATAAAATT 


TCAGTAAAAT 


GCTTATACTC 


TTCGAAAATC 


CAATTCAAAC 


TACGTCAACG 


9360 


TCGCCTTGCC 


ATGGGTATGG 


TTACTGACTT 


CGTCAGTTCT 


ATCCACAACC 


TCAAAACAGT 


9420 


GTTTTGAGCT 


GACTTCGTCA 


GTTCTATCCA 


CAACCTCAAA 


GCAGTGCTTT 


GAGTAACCCG 


9480 


CGGCTAGTTT 


CCTAGTTTGT 


TCTTTGATTT 


TTATTGAGTA 


TTATTCGCTT 


TTTACTCGTT 


9540 


TGACATAGTT 


TTCAATTGGG 


TAATTTAGAG 


GGTCCAAGGT 


CAACTCCTTG 


TCTTGGATCA 


9600 


GTTGGGCTAG 


ATGGTAACCA 


ATGATAGGAC 


CAGTTGTGAG 


GCCTGATGAA 


CCTAGTCCAC 


9660 


TGGCTGCATA 


GACACCAGTT 


AAGTCAGGCA 


CCTGCCCAAA 


GAAAGGAGAG 


AAATCACTGG 


9720 


TGTAGGCACG 


GATTCCAACA 


CGCTCAGATT 


TTGAAGTAGC 


TTCAGCCAAA 


ATCAGATAGT 


9780 
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GAGTCAAGGT GGCCTCCTCC ATTTGTTGGA GCAAGGTTTC ATCTACCGTC AAATCAAATC 984 0 

CC AT GT CAT T TTCGTGGGTA GCGCCTAAGG ATAATTTCCC ACCTGCAAAG GGAATCAAAT 9900 

CCCACTCCCC TTCTGGCATG ACAACAGGGT AATCTTCCAT GTCTTGGGCA AG CTG AT AAT 9960 

CTCGTAGTTG TCCTTTTTGA GGACGGACAT CCACTTCATA ACCTAAAGGC TCTAACATGT 1002 0 

CCCCCAACCA AGCTCCCGTC GCCAAAATAA CCTGCTCAAA CTCCTCTTCA CCAATCTGGT 1008 0 

AGCCTGATGC TAACGGTGTC AGAGTCACTT TTTCTTTGAC CAGCTTGACA TGACTGACTT 1014 0 

CCAGCAAACG AGT C ACT AAA AGTTGGCCAT CTACTCTCGC TCCACCAGAA GCATAGAGCA 102 00 

GGCGGTCAAA TCCCTGCAAA CCAGGGAATA ATT C ATT AG C TGAGGCTTGG TTCAGAATGG 10260 

CTAATTGCCC TATCAAGGGA GATTCTTCTC TGCGCTGGAG GGCCAGTTGA TAAAGTTCTT 1032 0 

CCAAATTGGA TTCATCCTTT TTCAAGAGAA AGACTCCCGA ACGCTGGTAA AAGTCGATTT 103 80 

CTTGTCCTGA TTTCTCTAAA TCAGCTAATA AATCCACATA AAAAT C AG C C CCCAAGCGCG 10440 

CCATCTTGTA CCAGGCTTTA TTACGGCGTT TGGAAAACCA AGGACTGATA ATTCCTGCTG 10500 

CGGCCTTGGT GGCTTGACCT TGCTCATGGT CAAAAACGGT CACCTCTAGG TCACTTTCTC 105 60 

TCGAGAGGTA GTAGGCAGCT GTTGCTCCCA CAATTCCTGC TCCAATAATG GCAACTTTTT 10 62 0 

TCATTGTCTT CACTTTCTAA CTAGATATGA TGGAAAGGAT TGGTTGATGC CTGACTAGGC 10680 

AAGATATCAA TAGACCACCC CTTATCTTCC TTCCATTGAC TAAGAAGTGC TGCGATTTTT 10740 

TCTACAAAAA TCACTTCGAT AT AGT G AC C T GGGTCCAATG CAAGCAACCC AT C AG AT AG C 10800 

ATATCCTGAG CAGTATGGTA GTAGATATCA C C AGT GAT AT AG AC AT C TG C CCCCTTTGCC 10860 

AAAGCATCCT TATAGAAAGA CTGCCCGCTT CCACCACAAA TTGCTACTCT TGAAATAGGC 10 92 0 

TTCTGCAAAT CATCCTCTTG AT AATG C AC C ATTCGAAGGC TATCTAGGTC AAAGACTTGC 109 80 

TTGACCTGTT GGGCCAATTC CCAAAATGTC TGAGGCTGAA TAT T C CC AAT ACGTCCAATT 11040 

CCACGTTCTG GACCTGTTTC CTGCAGATAA GTCGTCTCCT CGATTCCTAG CATCTGACAA 11100 

AACCAGTCAT TGAGCCCATT TTCAACGATA TCAATATTGG TATGGCTGAC ATAAACTGCG 11160 

ATATCATGCT TAATCAGGTC GATGTAAATC TGATTTTGCG GACGGCTGGC AAGCAAGTCC 112 2 0 

TTGATAGGAC GAAAGATAGG CGCGTGCTTG ACG AT AAT C A AGTCCACACC CTTTTCAATG 112 80 

GCCTCTGCCA CTGTCTCTTC ACG AAT AT CG AGGGCAACCA TGACCCTTTG GATACCCTTG 1134 0 

TCTAAAGTGC CAATTTGCAG ACCACGGCTG TCTCCCTCCA TAG AAAAT T C CTGAGGGCAA 114 00 

AAGGCTTCAT AAGCTTGGAT CACTTCACTT GCTAACATGG AGCACCTCCT TGATAGCTTG 11460 

AATCTTATCT AC T AG AACTT GACGTTCTTC CAGATTTTTT TCTGGGATTT GTCCGAGGGC 1152 0 

GAACTCTAGC TTCTCAGCTT CTTTTTGCCA TTTTTGGACA AAT ACT GG AC TGACTTCTTT 11580 
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GGACAAGAAG GGACCAAAGC GAACATCACT GGCTGATAGC TTCATTTGTC CTGCTTCCAC 11640 

CACCAAAATC TCATAAAACT TTCCAGCTTC TTCTAAGATG CTTTCTGCTA CAATCTGGAA 117 00 

TCCATGATCC TGTAGCCAGA TACGCAAGTC GTCTTCACGA TTATTGGGCT GGAGGATCAA 117 60 

ACGCTCTACA TTAGCTAACT TCCCCAAACC TTCTTCTAAA AT C CT AG C AA TCAAACGACC 11820 

ACCCATGCCA GCAATGGTAA TGACAGACAC TTGGTCAGTC TCTTCAAAAG CTGCCAAGCC 11880 

ATTGGCTAAA CGGACTTGGA TTTTCTCCTT TAGGCCGTGA GCCTCAACAT TTTTAACCGC 11940 

AGACTGATAG GGACCTTCCA CCACCTCACC TGCAATAGCG CTTTTGATTT GGCCTCTCTC 12 000 

AACCAACTCG ATAGGCAGAT AAGCATGGTC ACTTCCCACA TCTAGTAAAA TAGCCCCCTG 12 0 60 

TGACACAAAG GAAGCTACCA ATTCTAATCT CTTTGAAATC ATCTTCTCTC ACTTTCCAAA 1212 0 

ACTCTATTAC CTCTTATTAT ACCACATTTC AATCTTCAAC TTCCCAGTAA TATAAGCACC 1218 0 

TCTGGCGAAA GAAGTTTCAA TGTCCTAAAG TAATAAGTGA ATCCAATTGA AAGATTTTAA 12240 

ACAATTTGCA AAAATGT C AA AAAATAAAAA ATAAACAGTT TAT T C AG AAA ATTCTTGACA 123 00 

TATAAAAACA CAT GGT AG AA TATAATTAGA AAGT T AG AAA AAATAAAAGT TTGACTAAAA 123 60 

TTTGTATTTG AAGGTGGTGT T C AG AT AAG A AATTTAGTCA GACGAACCAC GAATTTGCTC 12 42 0 

TATGCTTTCT GGAATTTATC ATAACAGGAG GATACAGTCA TGGAACAAAC ATTGTTTGAA 124 80 

TTAGAACTAC TTCCAGAGGA AGATATCATT GTCACAGGTC TCCCTAAGTA TTGTTCTTTT 12 54 0 

ACTTGTTTAA TTACAGGTCG CTAGTTATAT TTTATATAAA ATAAGTAGCT TTACTTACGG 12 600 

AATAGGCTAG TGCTGTGTCT CTAGCCTATT TTAATAATTA GGAGTTTGTT ATGGATTTAT 12 660 

TAGAGAAAGA ATGTTTAAAA TGTGATAAAA ATTTCCAACA GGGTGATATT TGGAATTACT 127 2 0 

ATTATTTATC AGATAAGATG CCTGCACAAG GGTGGAAAAT ACACATAAGC TCCCAAATAA 127 80 

AAGACGCTGT AAATATTTTT AAGATTGTGT AT AAAC T AT C CCAACTAAAT AATTGTAGCT 12 84 0 

TTAAAGTTGT T AAAAATT T A GAGGAATTAA AAAAAATTAA TTCCCCTAGG GAAATGAGCC 12 900 

CTACTGCTAA CAAATTTATA ACTCTATATC CT AAGT C AG A ATCTGAAGCT AAGAGTATGA 12 9 60 

TTTGTAATCT TACGAATAGA CTGTCAGAAT TTAAGGCTCC AAAAATACTA TCTGACTATC 13 020 

AATGTGGAAT GCATTCTCCA GTTCATTATA GATATGGGGC TTTTTTAAAA AAACAAGCTT 13 080 

ATGATGAAAA AAATAAAAAA GTCATCTATT TATTGCTAGA TGAAAAAAGG AAG AACT AT G 1314 0 

TAGAAGATAA GAGACAAAAT TTCCCTAGTC TTCCTAGCTG GAAAATGGAT TTATTTTCAG 132 00 

AAGAAG 132 06 
(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13104 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 34: 

CCGG AT C C AG CGAAAAATAT GCTCTTTGAT GCTGTAAGTG GTCAAAAAGA TGCTAAAACA 60 

GCTGCTAACG ATGCTGTAAC ATTGATCAAA GAAACAATCA AACAAAAATT TGGTGAATAA 12 0 

AAAATTTGTT CAAGGGGGGT GG AAAT C AAA TCCCCCTTTG AATTTATCAA TAGAGACACA 180 

AATAATTTAG CTTTCTTATA AAAAAGTAGT ATCCTATGAA AGGAGTTAAT ATGGAAAAGC 24 0 

AACAACCTAG TAAAGCAGCC CTGCTGTCTA TCATTCCTGG GTTAGGACAG ATTTACAATA 3 00 

AACAAAAAGC CAAAGGTTTT ATCTTCCTTG GTGTAACCAT CGTATTTGTC CTTTACTTCC 3 60 

TAGCACTTGC AACCCCTGAA TTGAGCAACC TCATCACTCT TGGTGACAAA CCAGGTCGTG 42 0 

ATAATTCCCT CTTTATGCTG ATTCGTGGTG CCTTCCATCT AATCTTTGTA ATCGTTTATG 4 80 

TACTCTTTTA TTTCTCAAAT ATCAAAGATG CACATACGAT TGCAAAACGC ATTAACAATG 54 0 

GAATTCCAGT TCCACGCACA CTCAAAGACA TGATCAAAGG GATTTATGAA AATGGCTTCC 600 

CTTACCTCTT GATCATTCCA TCTTATGTTG CCATGACCTT CGCGATTATC TTCCCAGTTA 6 60 

TCGTAACCTT GATGATCGCC TTTACCAACT ACGACTTCCA ACACTTGCCA CCAAACAAGT 720 

TGTTGGACTG GGTTGGTTTG ACCAACTTTA CAAACATTTG GAGCTTGAGT ACCTTCCGTT 7 80 

CTGCCTTTGG TTCTGTTCTT TCTTGGACTA TCATTTGGGC TTTGGCAGCT TCTACTTTAC 84 0 

AAATCGTAAT TGGTATCTTC AC AG C T AT C A TTGCCAACCA ACCATTTATC AAAGGAAAAC 9 00 

GTATCTTTGG TGTTATTTTC CTTCTTCCTT GGGCTGTCCC AGCCTTCATC ACTATCTTGA 960 

CATTCTCAAA CATGTTTAAC GATAGTGTCG GTGCTATCAA CACTCAAGTA TTGCCAATCT 1020 

TGGCTAAATT CCTTCCTTTC CTTGATGGAG CTCTTATTCC TTGGAAAACA GACCCAACTT 108 0 

GGACTAAGAT TGCCTTGATT ATGATGCAAG GTTGGCTCGG ATT C CC AT AC ATCTACGTTC 114 0 

TGACCTTGGG TATCTTGCAA TCTATTCCTA ACGACCTTTA CGAAGCAGCT TAT AT TG AC G 12 00 

GTGCCAACGC TTGGCAAAAA TTCCGCAACA TCACTTTCCC AATGATTTTG GCTGTTGCGG 12 60 

CACCTACTTT GATTAGCCAA TACACCTTCA ACTTTAACAA CTTCTCTATC ATGTACCTCT 132 0 

TCAATGGTGG AGGACCTGGT AGTGTCGGAG GTGGAGCTGG TTCAACCGAT ATCTTGATCT 13 8 0 

CATGGATCTA CCGTTTGACA ACAGGTACAT CTCCTCAATA CTCAATGGCG GCAGCTGTTA 1440 

CCTTGATTAT CTCTATCATT GTCATCTCAA TCTCTATGAT CGCATTCAAG AAACTACACG 1500 
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CATTTGATAT GGAGGACGTC TAAGATGAAT AACTCAATTA AACTCAAACG TAGACTGACT 1560 

CAAAGCCTTA CTTACCTTTA CCTGATTGGT CTATCAATTG TAATTATCTA TCCACTGTTG 162 0 

ATTACCATTA TGTCAGCCTT TAAAGCAGGT AACGTCTCAG CCTTTAAACT AGATACTAAT 1680 

ATCGACCTCA ATTTTGATAA CTTTAAAGGC CTCTTCACTG AAACCTTGTA CGGTACTTGG 174 0 

TACCTCAACA CTTTGATTAT CGCCTTAATT ACCATGGCTG TTCAAACAAG TATCATCGTA 1800 

CTTGCTGGTT ATGCTTACAG CCGTTACAAC TTCTTGGCTC GTAAACAAAG TTTGGTCTTC 1860 

TTCTTGATCA TCCAAATGGT GCCAACTATG GCCGCTTTGA CAGCCTTCTT CGTTATGGCG 192 0 

CTTATGTTGA ACGCCCTTAA CCACAACTGG TTCCTCATCT TCCTCTACGT TGGTGGTGGT 1980 

ATCCCGATGA ATGCTTGGCT CATGAAAGGC TACTTCGATA CAGTGCCAAT GTCTTTAGAC 2 04 0 

GAATCTGCAA AACTAGACGG TGCAGGACAC TTCCGCCGCT TCTGGCAAAT TGTTCTACCA 2100 

CTTGTTCGCC CAATGGTTGC CGTACAAGCT CTCTGGGCCT TCATGGGACC TTTCGGGGAC 216 0 

TACATCCTCT CTAGTTTCTT GCTTCGTGAG AAAGAATACT TTACTGTTGC CGTAGGTCTC 222 0 

CAAACCTTCG TTAACAATGC GAAAAACTTG AAGATTGCCT ACTTCTCAGC AGGTGCTATC 2 2 80 

CTCATCGCCC TTCCAATCTG TATTCTCTTC TTCTTCCTAC AAAAGAACTT TGTTTCAGGA 2 340 

CTTACAAGTG GTGGCGACAA GGGATAATTT ATCCCCGCCA CCCTTTTTCA T TT T AT ACT C 24 00 

TTCGAAAATC TCTTCAAACC ACGTCAGCTT TATCTCCAAC CTCAAAGTTG TGCTTTGAGC 24 60 

AACCTGTGGC TAGTTTGCAC TTTGATTTTC ATTGATTATT AGCAATTGTC ACTGTAAATA 2 52 0 

AT AT C CTTGT AGCAAGCAAT TTTTCTCCTA GACTTGAAAT AAAGCGCATT TCTCTATATA 2 580 

ATAATACTCA TATAGAAAAC ACCTTTTAGA AAGATACCTA TGCTTCCATA TCCATTTTCC 2 64 0 

TATTTTTCAA GTATTTGGGG GGTTCGTAAG CCCCTGTCCA AACGTTTCGA GCTCAACTGG 2 7 00 

TTTCAACTTC TCTTTACCAG TATCTTCCTT ATCAGCTTGT CTATGGTACC CATTGCTATC 27 6 0 

CAAAACAGCT CCCAGGAGAC CTATCCGCTA GAAACTTTTA TCGATAATGT CTATGAACCT 2 82 0 

CTGACAGATA AGGTTGTCCA GGATCTCTCT GAACATGCTA CAATTGTCGA TGGCACATTA 2 880 

ACTTATACTG GAACAGCTAG TCAAGCCCCT TCTGTTGTGA TTGGTCCAAG TCAAATCAAG 2 94 0 

GAATTACCTA AGGACTTGCA ACTGCATTTC G AT AC AAAT G AGCTAGTCAT CAGCAAGGAA 3 00 0 

AGCAAGGAAC TGACCCGCAT CTCTTACCGA GCCATTCAGA CTGAGAGTTT CAAAAGCAAA 3 060 

G AC AG CTTG A CCCAAGCAAT TTCTAAAGAC TGGTACCAAC AAAATCGTGT CTATATCAGC 312 0 

CTCTTCCTAG TTCTCGGTGC GAGCTTCCTC TTTGGTTTGA ATTTCTTTAT CGTCTCTCTT 3180 

GGAGCTAGCT TTCTCCTTTA TATCACCAAA AGATCACGCC TCTTTTCATT TAATACCTTT 3240 



WO 98/18931 



PCT/US97/19588 



342 

AAAGAGTGCT AC C ATTTT AT CTTGAACTGT TTAGGATTGC CGACTCTGAT TACACTTATT 3 3 00 

TTGGGATTAT TTGGCCAAAA TATGACAACC CTGATTACTG TACAAAATAT TCTTTTTGTT 33 6 0 

CTGTATCTGG TCACTATCTT T T AT AAAAC A CATTTCCGTG ATCCAAATTA CCATAAATAG 3 42 0 

GAGATTTTTA TGCCCGTTAC GATTAAAGAC GTGGCCAAGG CTGCTGGTGT TTCGCCTTCA 348 0 

ACCGTAACCC GTGTTATTCA AAATAAATCA AC CAT TAG C G ACGAAACAAA AAAACGTGTT 3 540 

CGCAAAGCTA TGAAGGAACT CAACTACCAC CCAAACCTCA ACGCTCGTAG CTTGGTAAGC 3 600 

AGCT AT ACT C AGGTTATCGG ATTAGTTCTT CCTGATGACT CAGACGCCTT CTACCAGAAT 3 660 

CCTTTCTTTC CATCGGTTCT ACGTGGCATC TCTCAAGTCG CATCTGAAAA CCACTATGCC 3 72 0 

ATTCAGATAG CAACAGGGAA AGATGAGAAG GAGCGTCTCA ACGCTATTTC ACAAATGGTC 3 780 

TACGGCAAGC GTGTAGATGG GCTAATTTTT CTCTATGCCC AAGAAGAAGA CCCTCTCGTA 3 840 

AAACTCGTCG CAGAAGAACA GTTCCCCTTC CTTATCTTAG GTAAATCTCT ATCTCCTTTC 3 900 

ATCCCACTTG TCGACAACGA CAATGTTCAA GCTGGTTTTG ATGCGACTGA ATATTTCATC 3 9 60 

AAAAAAGGCT GCAAACGCAT TGCCTTTATC GGAGGAAGTA AAAAGCTCTT CGTGACCAAA 4 02 0 

GACCGTTTAA CAGGCTATGA ACAGGCGCTT AAACATTACA AACTTACCAC TGACAACAAT 4 0 80 

CGCATCTACT TTGCCGACGA GTTTCTGGAA GAAAAGGGCT ATAAATTTAG CAAGCGATTA 414 0 

TTCAAGCACG ATCCACAAAT TGATGCTATC ATCACAACCG ATAGCCTCCT AGCTGAAGGT 4200 

GTTTGTAACT ATATTGCCAA ACACCAGCTG GATGTCCCTG TTCTCAGCTT TGACTCGGTT 42 6 0 

AATCCCAAGC TCAACTTGGC AGCCTATGTC GATATCAATA GTTTAGAGCT TGGTCGTGTT 43 2 0 

TCCCTTGAAA CTATTCTCCA GATTATTAAT GATAATAAAA ACAATAAACA AATTTGTTAC 4 3 80 

CGTCAATTGA TCGCCCACAA AATTATCGAA AAATAAGAGA CTGGGCAAAA AGTCGTTAAA 444 0 

AG C AAAAACG CAT AC T ATC A GGTATTGAAA AAACT T GAT A CTATGCGTTT TATTGTGGGA 4500 

AGATTTACTT CCTTTTCTAC TGAAAT TG AG TCTTTTCCCA AGATCTTTTT AT ACT C AATG 4 560 

AAAATCAAAG TGCAAACTAG GAAGCTAGCC GCAGGTTGCT CAAAACACTG TTTTGAGGTT 4 62 0 

GTAGATGAAA CTGACGAAGT CAGTAACCAT ACCTACGGCA AGGTGAAGCT GACGTGGTTT 4680 

GAAGAGATTT TCGAAGAGTA TTAATCACTA ATT AT C TAT C TCAACAAATC TTCCTAGAAT 474 0 

ATGAAC AT TT TCCGAGACAG AGACAAAGGA GCTTGGATCC ACTTGTGTCA TAATCTGTTT 4800 

AAATTCATTA AACTCTGCAC GTGTAATGAC AGTGATTAAA ACTGCCTTTC TCTCGTGATT 4860 

ATAGGTTCCT TCTGCATCGT GGATCATGGT TGCTCCGCGG TGCAATTTTT TATGGATTTT 492 0 

TTCAATTACC TTCTCTGGAT GATTTGTCAC AATCATGGCC TGCATACGCT TTTGCTTAGT 4980 

AAAGACTGCG TCTGT C AC AC GGCTAGAGAC AAAGATGGTA AT C AT AGAAT AAAGAGCGTA 5040 
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TTTCCAACCA AAGGTCAAAC CTGCTATCAG CATGATAGTT CCATTTACCA AGAAAGAAAT 5100 

ACTACCGACA TTCTTACCCG TTTTCTTACG AATAGTCAGG CTGACGATAT CCGTCCCACC 5160 

ACTGGAGATA TTGTTTCGAA GAGCAAAACC AATCCCCAAA CCCATAACAA CACCCCCAAA 52 2 0 

AAGGGAATTG AT AATGG GAT CCTCTGTCAA GGTTGCCACA GGGACAAACT GGATAAAGAA 52 80 

GGAACTCATA G AT AC CGTG A TAAAGGTAAA GACGGTGAAC TTATGGCCAA TCTGATACCA 5 3 40 

AGCTAAGACC ATCAAAGGGA AGTT AATGG C GTAGAAGCTT AGCGAAATCG GAATATGAAA 54 00 

AC C AAAC C AG TGATTACTCA AGGCAGAGAT AATCTGTGCC AGACCTGTTG CACCACTCGA 54 6 0 

ATACACATGC CCTGGTTGGA AAAAGAAATT AACTGCTACT GCTGATAAAA AACCATAGAC 552 0 

CAGAGAGGCC GAAATCTTCT CATCATACTT TTCTCGAGAG ATACTTTGTA AGACACGTAA 55 80 

AATTTTTATC TGATAAGCAA AGCGGCGCAG ATAATAGCGC CACCGCTTAA TTCGTTTTGT 5 64 0 

TTGTTTCATC TTCTTCTACT TGTAAGCTGA GTTCCTCTAG TTGTTTGAGA GCGACTGTTG 57 00 

ATGGAGCTTG TGTCATTGGG TCAGTTGCCT TGTTGTTCTT AG G AAAGG C A ATGACTTCAC 57 6 0 

GGATATTTTC TTCTCCAGCA AGCAACATGA CAAAACGGTC AAGCCCGATA GCCAAACCAC 5 82 0 

CGTGTGGTGG GAAAC CAT AG TCCATGGCTT CAAGAAGGAA ACCAAACTGG TCATTGGCTT 5880 

CTTCAGTTGA GAAAC C AAG A GCCTTGAACA TGCGTTCTTG AAGGTCTTTT TGGTTGATAC 594 0 

GAAGGCTACC ACCACCAAGC TCATAACCGT TCAAGACGAT ATCGTAAGCA ATGGCACGAA 6000 

CCTTAGCCAA ATCACCTTCT AATTCATGAG CAGTCTCTTC CTGTGGAAGT GTGAAAGGAT 6 0 60 

GGTGGGCGCT CATGTAGCGG CCTTCTTCTT C AG ACC ATT C AAACATCGGC CAGTCAACCA 612 0 

CCCAAAGGAA GTTGAACTTA TCATTATCAA TCAAGCCAAG CTCTTTAGCA ATACGTCCAC 6180 

GAAGGGCACC CAGTGTTGCA TTAGCCACTT CAAGCGTATC CGCCACAAAG AGAACCAAGT 62 4 0 

CCTTATCTTC AAGAACAAGC GCTGTTGTCA ATTCTTCTTG GATACCAGTC AAGAACTTGG 63 00 

CAACTGGTCC GTTTAATTCT CCATCAACCA CCTTGACCCA AGCAAGACCT TTGGCACCAT 63 60 

ACTGTTTGGC TACTTCCGTC ATCTTGTCGA TGTCTTTACG TGAATAGTTG TCCGCAGCTC 64 2 0 

CTGTGACCAC AATCGCTTTT ACAGCAGGTG CTTCTGAAAA GACTTTAAAG TCTACACCTC 64 8 0 

GGACCACTTC TGTCAAGTCC TGAAGCAACA TGTCAAAACG AGTATCTGGC TTGTCAGAAC 6 54 0 

CGTAAAGAGC C ATAGC AT C A TCGTATTTCA TACGAGGGAA TGGTAGCGTT ACTTCGATGC 6 600 

CTTTTGTTTC CTTCATCACG CGCGCGATCA AGCTTTCTGT AATATCTTGG ATTTCTTGCT 6660 

CAGTAAGGAA GGACGTTTCC AAGTCGACCT GAGTAAATTC AGGCTGGCGG TCTCCACGCA 6720 

AGTCCTCGTC ACGGAAACAT TTAACGATTT GGTAGTAACG GTCAAAACCA GCATTCATCA 67 80 
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AGAGCTGTTT 


CGTGATTTGT 


GGACTTTGAG 


GAAGAGCGTA 


AAAATGCCCC 


TTATTAACAC 


6840 


GAGACGGCAC 


TAAATAATCA 


CGCGCCCCTT 


CAGGCGTTGA 


CTTAGAAAGG 


AATGGTGTCT 


6900 


CCACGTCGAT 


AAACTCCAAC 


T CAT C C AAGT 


AGTTGCGGAT 


AGAGTGGGTC 


ACCTTGGCAC 


6960 


GAAGTTTAAG 


ATTTTCCAAC 


ATTTCTGGAC 


GACGAAGGTC 


AAGGTAACGG 


TAACGCAAAC 


7020 


GTGTATCGTC 


ATTTGCCTCA 


ATGCCATCCT 


TAATCTCAAA 


TGGTGTTGTC 


TTAGCTGTGT 


7080 


TAAGCACAAT 


AAGAGCTGTC 


ACGTTTAACT 


CAACCGCACC 


AGTTGGCAAC 


TTATCATTGG 


7140 


CTTGTCACGC 


GCAGCGACCT 


GACCAGTCAC 


CTCAATAACA 


AATTCGCTAC 


GAAGGcTTTC 


7200 


AGCTGTTGCC 


ATAACCTCTG 


CAGATACTTT 


TTCAGGGTTG 


ATAACCAACT 


GCATGATTCC 


7260 


TTCACGGTCA 


CGAAGATCGA 


TAAAGATCAA 


ACCACCAAGG 


TCACGACGAC 


GGCCAACCCA 


7320 


TCCTTTCAAG 


GTTATTTCTT 


GTCCGATGTG 


TTCCTCACGA 


ACACGACCAG 


CATACATACT 


7380 


ACGTTTCATT 


ATTTCTCTCC 


TCTTTTATTC 


TGTTACTATT 


TT AC CAT AAA 


AGCGCAGCTC 


7440 


TTCATGAAAA 


T CAT C AG AAA 


AGTTTGCCAG 


TCTTTAAAAG 


TCAGGTGAAA 


GCCCTAAAAA 


7500 


TTAGCGCTAA 


TACTCTTCGA 


AAATCTCTTC 


AAACCACGTC 


AGCGTCGCCT 


TACCGTATGT 


7560 


ATGGTTACTG 


ACTTCGTCAG 


TTTCATCTAC 


AACCTCAAAA 


CCATGTTTTG 


AGCTGACTTC 


7620 


GTCAGTTCTA 


TCCACAACCT 


CAAAACAGTG 


TTTTGAGCAA 


CCTGCGGCTA 


GCTTCCTAGT 


7680 


TTGCTCTTTG 


ATTTTCATTG 


AGTATAATAC 


AAAAATCCGA 


TGAACTTCAC 


CGGACTCTTT 


7740 


TATTTTGAAT 


TTTTGCCTGC 


TTTACGCTTT 


TCAGCGATTT 


CGGCTGCCTT 


TCGAGGCAAG 


7800 


ACAATTTCCG 


TTATGTAAGC 


CGTCCCAAAA 


CGCAGTACAC 


CTGCAATAGG 


AGCAAAGACA 


7860 


ACT GC TAGAT 


AGTTATAGAA 


GAAATCGCCT 


TTGAAGGCAT 


AAGCTAGCGC 


TCCAATGATG 


7920 


AAAAATAGAA 


CGACTGCCTG 


AATCACTGCT 


AATAAAATTA 


CTCGTTTCAT 


GTGACCTCCT 


7980 


GACTCTATTA 


TAGCATGAGA 


AT CAT C AAAA 


AGCCGACTAA 


ATTATTCAAA 


GCGTGAAGAG 


8040 


AAATACTGTA 


GACCAGACCT 


TTTCTGCTAA 


TGTAAGCCAA 


ACCCAAACTA 


AAACCAAGGC 


8100 


TAAAATAGAC 


AAAAAATTGT 


TGCACATCAC 


CTGGAAAATG 


AATCAAGGCA 


AATAGAAGAC 


8160 


TAGAT AC C AG 


AAGAAAAATC 


AGGGTTCGTT 


TACTATTGTC 


CTGCTTAGGA 


AAG AG AT AG C 


3220 


GTGCTAACAT 


CCCTCTAAAA 


ACAATCTCTT 


CCGTCAAAGG 


AGCAAAAATA 


ACCACAGCAA 


8280 


AGAATGAGAA 


AAGTGGTTGA 


GACAAGGTCA 


AGTCTGTCGC 


TATTTGCTGA 


TTTACTGAAG 


8340 


GATCATCTGG 


CAAGAAGAAT 


TGAACGACCA 


GAGATAAGAA 


CCAAACCAAG 


ACAGGAAGCC 


8400 


AAATAAATCG 


ATTAAAGCCG 


CTCTTCTCAA 


TATGAACAGG 


AGCCTTCTGA 


TACCATTTGT 


8460 


AAATGCCGTA 


CACATATACT 


CCAGCCAAGG 


C C AC AT AG AG 


TAGAGTAACA 


GCATAGGGTG 


8520 


AAGCGCCTAA 


AGCAAGCGAC 


GCAGTCGCGA 


GCCCCTGAAT 


AAAG C CAT AG 


ATAAATAAAA 


8580 
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AGGATAGAAG GGCTAGAAGA ATCCAGCCAA GGTTTTTAAG TAATTTCATA GATAACTCCT 8640 

TTATTTGAAA TAACGTTTTA CCATAGGTAA CTGCATCACA T T GAT AT AAA CATGGATGGC 870 0 

TCCTACAAGC AAGAAAGCTA GTAACTGAAT CTCTCCTGTC AAGAAAGAAA TGATAATAAG 87 60 

AAAAATATAT AAGGCTGGTA AG AC AT AT TG GTGTAATTGG AATAAAATTC GAAAACTCTG 882 0 

TTCCAAATTA GCCTGACGCT CCCCTTCATC ATAAGAATTT ATATAGTTCA AGACATCCTT 8880 

TGGTGTAGCG AAAAATTCCA AATCAAACTG ACGAACAATC GCAATGGTTT TAAAAAGAGA 8940 

TTTTTGAGCG ACTAAGAATA C C AC AAAG AG TAAGAAAGAA AGGAAAAATG TTTGAGGGTT 9000 

TGTATGCAAT AT AATC AC C T CACTTAATGA AATAAAAATA GCCAATGGAA TCGCTACACC 906 0 

TGTAATATTA AAAGCAATGG TTCCAAACTC AAGATTCCGA T AC AT T TGC A CATAATAGGT 9120 

TTCATTCAGA TCGTCATCCA TTTCCTCTTG ATACAAAGAA TGAAATTTTC TGCTTTTCTT 9180 

TAAGAAATTG AAAGTCAAAA ACATACTAAT GAAACCTATC AGTAAACAAA TAGCTGATAT 9240 

CCATGGCATC AAGGCTTTTA CATCTAAAAT AATTTCGTGG GATTCGACAC GTGCCTTAAA 93 0 0 

CATCCCTACA AACATGCCCA AGAACCCCCC AAGACAATAG AC AT C AAAAA TAACAATCTA 9 3 60 

CGTTTCTTTT T C AT ATTC AT TCTCCTTTTT CACTTGCTAG ATTTTTGGAT TTCTTTTCAA 942 0 

TCCATTCAAT TACTGGGATG AGAGCAAAGT AGACCCAAAC AAATTGGTCG CTTTGATAGG 94 8 0 

GATTAAACCA GCTTAGGTCC ATCCCAATCA GTAGAAATAC GCTGACTAAT AAAGCTATGA 9 540 

CCACTACATA ATAAATCACT TTATACTTGT T CATC ACT CG TCCTCCTCCA AACGAAATAC 9 600 

CGATTCGACT GTTTCGTTGA AAATTTGAGA TATTTTCAGG GCAATGATAA TGGATGGGGT 9660 

GTACTCATCC CGTTCTAGTA GGCTAATGGT CTGTCTGGAA ACCCCTGCCA GTTTGGCTAG 972 0 

GTCGGTTTGA TTGAGACCAT CGCGAGCTCG AAGCTCTTTT AGACGATTTT TTAGTTGCAT 97 80 

GTTACACACC TACTCTCCGT CAAATTCAAC GGTTTGGATA TCCTCAATAC GTTGCAACTT 9840 

GAATTTTTCT TTTCCCGTAT TATCTACACG TCGTAGCTTT ACCCATTCCT CAT C AAC ATC 9 900 

CACAACTTCC CAGTTATCTG GCCCAATATA CACTCCCGTT ATAATTGGTT CCTTTCCAAT 9960 

CATTTCTTGT AATAATCTCG ACATTTCTGC GTTTCCTTTC TCTTTTCGCT CAAGTCTTTT 1002 0 

GATTTTATTC TCTAGTTTCT TGATTTTTTT AGAATTATTA GAATAAAAGA AAAT CAT AAA 1008 0 

TAGTATAAAT CCTAGTACCC ACATTATAAC TCCTTTCTGC TTCCTATTTC TTAACTTGAA 10140 

TTCATTGTAA CATATCTTTT TCTTTTTGAC AAGTATAGTT GTCAAAAAAA TTATGATTTT 10200 

TGTCATTTTG CAAAAGAAAA AGGTCAGGAG TAGGTTCCTG ACCACTTTAT CTATCATTAA 102 60 

TACTCTTCTA AAATCTCTTC AAACCACGTC AGCTTCACCT TGCCGTAGGT ATGGTTACTG 10320 
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ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG AGCTGACTTC GTCAGTTCTA 10380 

TCCACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTCTATCCAC AACCTCAAAA 10440 

CCATGTTTTG AGCTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTTGAGCAA 10500 

CCTGCGGCTA GCTTCCTAGT TTGCTCTTTG ATTTTTATTG AGTATAAAAT CCTAGTTTTT 10560 

CAAAGATTTC TGAGAAGTTT TGGCTGATTG TCTCAAGTGA CACTTGCACT TCTTCTCGGG 10620 

TTTGGTTGTT CTTGACCGTC ACTTGTCCGC TTTCGACTTC GCTCTCTCCT AGGGTGATGA 10680 

GGGTCTTAGC CGCAAAGACA TCGGCTGACT TGAACTGAGC TTTTAGTTTA CGGTTGAGGT 10740 

AATCACGCTC TGCTTTGAAA CCTTGTTGGC GAAGAGCCTG TACCAATTCC AAGGCCTTGA 10800 

TATTTGCCCC TTCGCCCAAG ACTG C GAT AT AGACATCTAG GGCGTTTTCG ATAGGGAGGG 108 60 

TCACACCTTG CTTTTCAAGG ATGAGAAGCA GGCGCTCTAC ACCAAGTCCA AAACCAAATC 1092 0 

CAGCAGTTTC AGGGCCTCCA AAGTAAGCAA CCAAACCATC GTAGCGACCA CCCGCACAGA 10 9 80 

CGGTCAGGTC ATTGCCCTCA ATCTCTGTGA TAAACTCGAA AATGGTGTGG TTGTAGTAGT 11040 

CCAGACCACG CACCATATTG GTATCGATGA TGTAATCTAC TCCAAGATTT TCCAACATCT 11100 

G AC GC AC AG C AT C AAAATGA GCTTGGCTTT CTTCATCAAG AAAGTCCAAG ATAGACGGCG 11160 

CATTCTCTAC TGCCACCTTG TCTTCTTTTT CCTTAGAGTC CAAGACACGA AGAGGATTTT 1122 0 

CCTCCAAGCG ACGTTGGCTA TCCTTAGACA AGGTCTCCTT GAGCGGTGTC AAATAGTCAA 1128 0 

TCAAGGCTTG GCGGTAGGCT GCACGGCTCT CAGGATTTCC AAGAGTGTTG AGGTGCAATT 11340 

TGACACCTTG AATACCGATT TCCTTCAAAA AATGGGCTGC CATAGCGATT GTTTCCACAT 11400 

CGGTAGCTGG ATTGCTAGAG CCAAAACACT CAACACCAAT CTGGTGGAAT TGGCGCAAGC 114 60 

GCCCTGCCTG TGGACGCTCA TAACGGAACA TAGGTCCCAT GTAGTAGAAC TTGCTTGGCT 1152 0 

TTTGCACTTC TGGGGCGAAA AGTTTATTTT CCACATAGGA ACGGACAACG GGTGCAGTTC 115 80 

CTTCTGGACG GAGGGTAATA TGACGGTCAC CCTTGTCATA AAAATCGTAC ATTTCCTTGG 11640 

T T AC GAT AT C CGTTGTATCT CCGACAGAGC GACTGATAAC CTCGTAATGC T C AAAAAT AG 11700 

GCGTGCGCAC TTCTGCATAG TTGTAGCGTT TGAAAATCTC ACGGGCAAAG CCCTCAACGT 117 60 

ACTGCCACTT AGCAGACTCA GCAGGTAAAA TATCCTGCGT TCCTTTTGGT TTTTGTAATT 11820 

TCATAGGGAA TCCTCTTTAA ACTTAATAGT CTTATTTTAC CATAAATAGA GGGATTAAAA 118 80 

CAGTAAGAAA AAAATTAGGA TTTAGATATC ATTTTTGAGA TTAAGAATTG TCAAAAAAAT 11940 

AGCTAGCAAG GAAAGACCAA CAAATAGCAT CCAAGTCAAC TGTATATTCC ATACGGCTAC 12 000 

TAGTGAAAAA CAAGCTGTTC CCACAGGTAT GGATAAGGTA AACAATAGAC CTAAAAAATT 12060 

ACTAGTACGA GCTAGAACCT CTGGAGCTAG ATTTTTCATG AGCATGGCAC TAATCTTTGG 1212 0 
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TTGAACTTTA CC AG AC AC AT ACAGAGTAAA GAAGAGAAAT AGCAAACCAA GCACGACTTG 12180 

AT TG AAT AAA TTAGCCAAAC CAACTAGACT AAGTCCTACG GTCTCCCACA TCATCAATCT 12 2 40 

AGGCAAGGAC TGCTTCCCAA AAT AAT CAT T GCCCGTAAGG CTACTGATGA T G AC T GAT AC 12 3 00 

TAAAACACAG AATTGATTGA TAAATAGTGC CTCTGTATAA GAAAAATTCA AGAGAGAATG 123 60 

GCTCAAAAAG AAG AT AT T AT AAATTCCACC CAAAGCGCCA CCCAAGGAAT TAATAAGCAA 1242 0 

G AC AG C AAAG AGCATAAAAC CAAAGTTTTT CTGTCCACTT TTAAGAAAAA CGAGACGTAA 12480 

ATTTCGGTAA ATTGTTAGGA ACTGGTCTTT GATAGAAAGC TTCTCATTTT TTAAGTTTTC 12 540 

ACCATCAGCA GATGACATTG ACAGGCTCAA TTTGCTTTTT CCTAAAAAGA GGATAGTGGC 12 600 

TGATACTAGG AAAAAGCAGG CATTGATTCC CGCAACGAGA GAAAAATTGT TGACCGATAG 12 660 

AGCTAAGAGC CAGACTCCGA AAGCTTGACC ACCAATAGCT GAAATATAGG TGATGAACTG 12 72 0 

TGAAAAAGAA TAAGCCTCCA T C AG AT CATC TTCAGCTACT TTTTCCTTAA T AAGAGG C AT 12 7 80 

ACGCAGGCCA CCTGCAAAAT CACTGATGAT ATCACTAATG ACATTGATCA AACACAGGCT 12 84 0 

AGAAAAGGCA AAGAGACTAG CTTGCTGAAC AACTAGGGCT GCTAGAAAAA AT AG AAC CGC 12 9 00 

CT G AAAC AAA CCGCTATAGA CCATCCATTT GACCTTGTCC CTCGTGTAAT CTGCCCGAAT 12 9 60 

CCCTGCAAAA AC T GT AAAG A GGGTCGGAAG AATCATGACA ATATTCGCCA TAG C AAC AG C 13 02 0 

AAAAGATGCT TGTGACAAGG TCGATGCATA GACGATAAAG ACCAGGTTGA AAATCGAAAC 13080 

AC C AAAAG C A TTGAAGAAGC GTGG 13104 
(2) INFORMATION FOR SEQ ID NO: 35: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19250 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CCGGGCAAAT AGTTTTGAAC TTTTCATCAT TTTCTCCTTT AAAACTTTCT CTCCATTATA 60 

GACTCTTTTC AGAAAGTTGT CAACAGAATT TT C AG AAT T T TTGAAAATTA TTTTTCAAAC 12 0 

AACATCTTTG CAAAAAATAT GAATATCGTA AGCGCGTCAT AACAAGGTAT CTATCATTCA 180 

TGGAGCTCCT CCTGTATACT ATTAGTAAAG TAAATATTGG AGGATATTTT AATGCCACAA 240 

CCTATTGTTC CTGTAGAGAT TCCACAATCT CGTCGTTTTG ATTCTAAAAA GAGAAATGAT 3 00 

ATTCTrCTTA AAATTCGTAT TGGCAAGCTT GAAGTAAGTT TTTTTCAATC TCTCAATCTC 3 60 
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GAAATGATAG AACAGCTTTT GGATAAGGTG TTGCTCTATG ACAATTCATC TATCTAGCCT 42 0 

AGGGCAGGTC TATCTCGTGT GTGGGAAAAC TG AT AT GAGA CAAGGAATCG ATTCACTGGC 4 80 

TTATCTCGTT AAAACCCACT TTGAATTGGA TCCTTTCTCC GGTCAAATCT TTCTCTTTTG 54 0 

TGGTGGACGT AAAGACCGCT TTAAAGTCCT TTACTGGGAT GGTCAAGGAT TTTGGCTACT 6 00 

AT AT AAAC G C TTTGAGAACG GCAGACTGAC TTGGCCCAGT ACAGAAAAGG ATGTCAAAGC 660 

TCTCGCACCT GAACAAGTAG ATTGGC T GAT GAAAGGCTTT TCTATCACTC CAAAAATATA 72 0 

GTAGATTGAA ACTAGAATAG TACACCTCTG CTTCTAAAAC ATTGTTAGAA ATCGATTTTA 7 80 

CTGTCCTGAT CGATTTGTCC TGTTATTATT TCATTTTACT ATAAATCCAT CAGAAAGTCG 84 0 

TGATTTCTAT TGAAATGAGG ACTTTCTTTT TATACTCATC TGCTTTCAAA AAGCACTCTA 900 

GTCCATCTCC GATTAACGAT GGACTTTATC ACCTCCTTCT CCAGTCCTTG TATAACATCT 960 

TGAAGTTGAT TCATGACATC TTCCAAAGTT CGAAAGGCTT TATTCTTAAA TCCACGTTTA 1020 

CGAATCTCTT TCCACACTTG TTCAATGGGG TTCATCTCTG GTGTGTATGG AGGAATAAAT 1080 

GCAAAGCCAA TATTAGTCGG AATCTTTAAG GT AC T T G ATT TATGCCATAT AGCATTGTCC 114 0 

ATAACGAGTA AAAGATAATC ATCTGGATAA GCTTGTGAAA GCTCCTATTC CTAAAGCCCC 12 00 

TTTATAACCT CTTGCGAGAG AGACTATTGA CTCAGCCCTT ACTTCATGCG GATGAAACCT 12 6 0 

CCTATCGGGT TCTAGAGAGT GATAGCCATC TGACCTACTA TTGGACTTTT TTGTCAGGTA 132 0 

AAGCAGAGAA ACAAGGGATT ACGCTTTACC ACCATGATCA GTGTCGAAGT GGTTCAGTAG 13 80 

TACAAGAATT CCTAGGAGAT TATTCTGGCT ATGTTCATTG TGATATGTTG CGGCAGTAAC 144 0 

TTAGGACTTT AGTCCTCTAG TTCTGCCTAT GCGATAGCAG TCCAAGGTTT AGGAGTAAGG 15 00 

CGACGCTAAG CTTGGTAAAC TGCGAACAGC TAGAAGCTTA TCGTCAACTG GAAGAAGCTG 15 60 

CACTTGTTGG ATGTTGGGCG CATGTGAGAA GGAAGTTTTT TGAAGTGCCC CCCAAGCAAG 162 0 

CAGATAAATC ATCCTTAGGA GCTAAAGGTT TAGCCTATTG TGATCAGTTA TTTTCCTTGG 1680 

AAAG AG ACT G GGAGGCTTTG CCAGCTGATG AACGGCTACA GAAACGTCAA GAACATCTCC 1740 

AACCCCTACT GGAAGACTTC TTTGCTTGGT GCCGTCGTCA GTCAGTTTTA TCGGGTTCAA 1800 

AACTAGGAAG GGCAATTGAA TACAGCCTCA AGTATGAAGA AACCTTTAAG ACCATTTTAA 1860 

AAGACGGACA TCTGGTCCTT TCCAATAATC TAGCTGAACG CGC C ATT AAA TCATTGGTTA 1920 

TGGGACGGAG TAAAAGAGTC CAGTGGACTC TTTTAGCCTA AGCTCAGTTT AAAAAAACGA 1980 

GGGTGGTTAT TTTTAAAAAA GCGAGGGTGG TTATTTTCTC AAAGTTTTGA AGGAGCTAAA 2 040 

GCAAGAGCTA TTATTATGAG TTTGTTGGAA ACAGCTAAAC GTCATCAATT ATAGTGCGTT 2100 

GAATCTATAA CAGTACGCAT CGACTGCTAA AATATTTCTA TAAATCAATT TTCCTTTCCT 2160 
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AATCGATTTG TT C AT AT C T T ATTACAATCC ATTATAAATA GCGAGAAATA TCTATCCTAT 222 0 

CTTCTAGAAT GTCTTCCAAA CGAGGAAACT CTCGTAAACA AAGAGGTTTT AGAGGCCTAT 2280 

TTACCGTGGA CTAAAGTTGT ACAAGAAAAG TGCAAATAAG AAATCTCCAG ATTAGGAACT 23 40 

ATATATGAGT TCTCTAGTCT GGAGATTTTT CAATAGACTT CGTTATTGGG CGGTTACTTT 2400 

CGAAACTTTG AAAACTTCAA AAAACGGATT TTTATCGCTC TGAACATCAA AAAAGAAAGG 2460 

ACGAAATTTG TCCTTTCTCA AGCTTAGCTT TTCTTCAACC CACTACAGTT GACAAAGAGC 2 52 0 

CCTTTATTCT ATCAAACATG AAGCGCAAAA AC AAGC C AAA AATCCGATAG AATGGCTATC 2580 

CCTCGACTAT CAAGTAAGAC ATTTCCATCA AATACGTTCA ATTTTACTCT TGTTCTACTA 2640 

AGAATTAATC ATCTCGTTTT GATTTATTAA AAATATACAA TTCAGCTTTT CCTCCAAACT 2 700 

ATTTTATCCA CTATCCCTGT ATAGCTCTGT ATTATCTTAA CAACTTTAGT AGAGACATTT 27 60 

TCCTCAACAT AATCCGGAAC CGGTAATCCA AAATCCTCAT CTTGTGCCAA GCTAACAGCA 2 82 0 

GTTTCAACTG CTTGAAGAAG AGAATTTTCA TCAATGCCTG CCAAAATAAA TCCTGCCTTA 2880 

TCTAAGGACT CAGGACGTTC TGTACTTGTA CGAATACATA CAGCGGGAAA AGGATAACCT 2940 

TGACTAGTAA AGAAACTACT TTCTTCCGGT AAAGTTCCCG AATCAGATAC TACAACAAAT 3 000 

GCATTCATCT GTAAACAATT ATAGTCATGG AATCCTAGTG GCTCATGCTG AATCACACGT 3060 

TTATCTAGTT TAAAACCGCT CTCTTGTAGC CTTTTCTTTG ATCTAGGATG GCAAGAATAT 312 0 

AAGATTGGCA TATTATACTT TTCAGCTAAT TGATTAATTG CTGTAAAGAG AGAAATAAAA 3180 

TTTTTATCTG TATCAATATT TTCCTCACGG TGAGCTGAAA GTAAGATATA ACCTCCTTTT 3240 

TTCAATCCCA AACGTTCATG GATATCTGAA GACTCAATAG CAGATAAATT TTTATGTAAC 3 3 00 

ACTTCTGCCA TAGGAGAACC AGTTACATAT GTGCGCTCTT TAGGTAAACC ACACTCATGT 33 60 

AAAT ACT TAC GTGCATGTTC AG AGTATGC T AAGTTAACAT CTGAAATAAC ATCAACAATC 342 0 

CGACGATTAG TCTCTTCCGG TAGGCACTCA TCTTTACAGC GATTGCCAGC C T C CAT AT G A 3480 

AAAATTGGAA TATGTAAACG CTTGGCAGCA ATAGCTGATA AACAAGAATT TGTATCCCCT 354 0 

AAAATCAATA AAGCATCTGG T TT AATTTG A TTCATCAATT TGTATGAAGT ATTAATAATA 3 600 

TTCCC TAC AG TAGCACCAAG ATCATCTCCA ACAGCATCCA TGTATACGTC CGGAGTGTCT 3 660 

AACCCTAAAT TATCAAAGAA AAT AC C ATT T AAATTGTAAT CATAGTTTTG TCCAGTATGT 3720 

GCCAAAATAA CATCAAAATA CTTTCGACAT TTAGTGATAA CACTACTTAG ACGTATAATC 37 80 

TCTGGACGTG TTCCCACAAT AATCAATAAC TTAAGTTTGC CATTATCTTT AAAGTGAATA 3840 

T C ACT AT AAT CTGTCTTAAT TTTCATTTAT TTCTCCACTT GTTCAAAAAA AGTATCTGGA 3 900 
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TGTCTAGGAT 


CAAATGACTC 


ATTAGCCCAC 


ATGACAGTAA 


TTAGATTTTC 


TGTATCAGAA 


3960 


AGATTAATAA 


TATTATGTGC 


ATAGCCCGGT 


ATCATATGTA 


TTGCTTCAAT 


CTTATCGCCC 


4020 


GACACTTCAA 


AGTTCAGAAT 


AGG AT ACT C T 


TGACCGTTTT 


CATCCAGCCC 


TAT C CT AC GC 


4080 


TCTTGTATTA 


AAGCACGACC 


AGAAACAACC 


AT G AAAAAT T 


CCCACTTAGA 


ATGATGCCAA 


4140 


TGTTGCCCTT 


TGGTAATGCC 


AGGTTTAGAA 


ATATTAACAG 


AAAATTGACC 


CGTATTTTCT 


4200 


GTTTTTAATA 


ATTCCGTAAA 


ACTACCTCGT 


TCATCTATAT 


TCATTTTTAG 


AGGAAACTTA 


4260 


AACTTATCTA 


CTGGTAAATA 


AGATAGGTAG 


GTAGAATACA 


ATTTCTTTTT 


AAACGATCCC 


4320 


TGAGGAATTT 


CAGGCATAAC 


TAAACTATCA 


GGCTGTTTTT 


TAAATGTTTC 


TAATAGAGAG 


4380 


ACAATCTCTC 


CTAAGGTTGC 


ACGATGAGTC 


GTTGGTACGT 


AGCAGTAGTT 


TCCTGATGGG 


4440 


CTAGGTAAGA 


TTTGTAATCC 


ATCTAGATTA 


CAACGATGAG 


GATTTCCTTC 


CAATGCAGTT 


4500 


AGACACTCTT 


GTATCAAATC 


AT C AAT AT AC 


AGCAACTCCA 


ATTCTACACT 


TGGATCATTT 


4560 


ACTTGAATAG 


GTAAATCGTG 


AG CT AGATT A 


TAACAGAAAG 


TTGCTACAGC 


AG AAT TGT AG 


4620 


TTAGGACGGC 


ACCACTTCCC 


ATAAAGATTC 


GGGAAACGGT 


AAACTAAGAC 


AGGTGCTCCC 


4680 


GTTTTCTTTC 


CATATTCAAA 


GAAGAGTTCT 


TCCCCTGCTA 


G CT T AG AT T G 


TCCATATATA 


4740 


GAGTTTGAAA 


ATCGGCCTTC 


TAAACTAGCT 


TGAGTAGAAC 


TTGAGAGTAG 


AACAGGACAA 


4800 


GTGTTTTCAT 


ACTTTTCTAA 


AATCTCCAAT 


AATCTACTTG 


AAAAACCGTA 


ATTTCCCTCC 


4860 


ATGAATTCAT 


CAGGATTCTG 


TGGACGATTG 


ACACCAGCTA 


AATGGAATAC 


GAAATCGGCC 


4920 


TTCTTACAAT 


ATTCATCTAA 


TAAAATCGGA 


TCTGTATCAC 


GATCATACTG 


AAAAATCTCT 


4980 


CCAATCTCTA 


AATTAGGACG 


AGTCCTATCT 


CGTCCATCTT 


TCAAAGCTTC 


CAGAGTACAG 


5040 


ATAAGATTTT 


TTCCTACAAA 


TCCTTTCGCT 


CCTGTGATTA 


AAATATTTTT 


AATCATGCCC 


5100 


CCTCCTTATT 


TTATATGCTG 


TTTTAATAGT 


TAACTCTCTC 


GACAATACAT 


GATACATTAT 


5160 


ATATCCTTGA 


TAATTTTAAT 


GTATCTTAAA 


AGATTTTACA 


TCTCTTCGTC 


T GC T AC CAT A 


5220 


TCACGAATTG 


CTGTCTGTAT 


TTCATCTAAT 


TCTAGCAACT 


TTCTTTTAAC 


TTGCTCTACA 


5280 


TCCATCAAAT 


CGGTATTATT 


ACTATTGAAT 


TCTGTCAACA 


AATTTCTATT 


CGTACTACCA 


5340 


TCTTTGAAAT 


ACTTATCATA 


GTTAAGATTA 


CGATTATCAC 


TAGGAACTCT 


AT AAAAAT C A 


5400 


CCCAAATCAA 


TTGCATTTGC 


GCACTCTTCG 


TTAGTTAATA 


GTGTTTCATA 


CCTTTTTTCT 


5460 


CCGTGTCTAA 


TACCTATAAT 


CTTAATATCT 


TGTTCTGAGG 


CAAAAATTTC 


TGATAC AGCC 


5520 


TTAGCCAACA 


CTTCAATCGT 


ACATGCTGGT 


GCTTTCTGAA 


CTAGTATATC 


TCCAGATTTC 


5580 


CCTTCTTCAA 


ATGCAAATAA 


AACCAAGTCT 


ACTGCTTCTT 


CCAATGTCAT 


CACAAAACGT 


5640 


GTCATGCTAG 


GTTCAGTAAT 


TGTAAGAGCA 


TTTCCTTGCT 


TAATTTGCTC 


AATCCAAAGA 


5700 
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GGAACGACAG ATCCACGGCT ACACAGAACA TTCCCATAGC GAGTCACACA TATCTTTGTA 57 60 

TGCTCAGGAT TTACCGTCCT GGACTTAGCA ACAGCAATCT TTTCCATCAT AG C C TTGG AT 582 0 

GTTCCCATAG CATTGACAGG ATAAGCCGCC TTATCTGTAG AAAGACAGAT AACTTGCTTT 5880 

ACACCAGCTT CGATAGCCGC AGTGAGGACA TTCTCCGTTC CCAAAATGTT AGTTTTTACC 5940 

GCTTCTACAG GGAAAAATTC ACAAGAAGGT ACTTGTTTAA GAGCAGCAGC GTGAAAAACA 6000 

TAATCCACAC CATGCATAGC ATTTTTTACC GAAGCTAAGT CACGCACATC TCCAAGGTAA 6060 

AAACGGATTT TCCCAGCCAC TTCTGGTACT TTTACCTGAA ACTCATGACG CATATCATCT 612 0 

TGTTTCTTTT CATCTCGCGA AAATATACGA ATCTCTGAGA CATCTGTTTC TAAAAAACGC 6180 

TTGAGAACCG C ATT C CC AAA TGAACCTGTC CCTCCTGTAA TTAGGAGAGT TTTTCCTGTA 6240 

AATTGTGACA TATATTACAC TTCTCCTTCT AGTATGTCTG CAATTTTCTT ACAAGCCGTT 63 00 

CCATCTCCAT AT GG AT T TG A AGCTTGACTC ATTGCTTGAT AAACTGAATC ATTTTCTAAT 63 60 

AATTCTTTAA AATGCCTATA AATATTATTT TCATCAGCAC CTACAAGTTT CAAAGTCCCT 6 42 0 

GCTTCAATTC CCTCTGGACG TTCAGTTGTA TCTCTCATAA CCAAAACAGG TTTTCCTAAA 64 8 0 

CTTGGAGCCT CTTCCTGAAT ACCACCACTA TCTGTTAAAA TTAAATAACT TCTTGATAAA 6540 

AAATTGTGAA AATCTAATAC TTCTAAAGGT TCGATCATCT TGATACGTTC AC AG C C ACTT 6600 

AGTTCTTCCT CAGCAATTTG GCGAACACGA GGATTCATAT GGATAGGATA AATAGCCTTG 6 660 

ACATCTGAAT ATTCTTCAAT AATCCTTCTA ATTGCTCTAA ACATATGTCT CATCGGTTCA 672 0 

CCAAGATTTT CACGACGATG AGCTGTAATT AGAATAAACC TGCTTTCTCC TATCCATTCT 67 80 

AACTCAGGAT GCGTATAGTC CTCTTGAATT GTAGTTTGTA AAGCATCAAT CGCCGTATTA 6840 

CCTGTCACAA ATATGCTCTC TGGAGTTTTT CCTTCTCTTA AAAGATTATC TTTTGAAAGT 6900 

TGTGTTGGTG TAAAATGATA CTGAGCCAAA ACCCCAACTG CTTGACGATT AAACTCTTCA 6960 

GG AT AT GGTG AAT AG AT AT C GTAAGTGCGC AAACCAGCTT CAACATGACC AATTGGAATC 702 0 

TGTAAATAAA AGGCCGCCAG TGAACTAGCG AAGGTCGTAC TTGTATCCCC ATGAACTAAC 7 080 

AC C AAAT C AG GTTTTTCTGA CTCTAAAATA GCCTTCATTC CTTCCAAAAT GCCAATGGTC 7140 

ACATCAAATA AAGTTTGTTT ATCTTTCATA ATAGACAAAT CAAAATCGGG AAT AAT CC C A 72 00 

AATGTGTCCA AGACCTGATC CAACATTTGA CGGTGTTGGC CCGTAACGCA AACTAATGTT 72 60 

TCAATATTCT TACGTGTTCT TAACTCTTTG AC C AAAGG AC ACATCTTGAT GGCTTCTGGA 7320 

CGAGTTCCAA ATACTACAAC TACTTTTTTC AT AT ATT T AC TTACTCCTAA CAAATAATGA 7380 

ACGGTTCTTA AAATAAATTA GATAACGGCT AATCCATAAC ACCACCTCAG AC AT AC TTG A 744 0 
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ACAAATAGCT AATGTTACTA AACTAAAATT AT C AG AC AAG AT AAAT AT T C CTAATCCCAA 7 500 

AGTTTGGACA AT CGAAG C T A ATATAGTTGT CAT T GT AGTT TCTTTCACTT TATCAATAGC 7 5 60 

TCCTAAGACA GGCCATCCGT AAAT CAT AG A ATAAAAACTA GCAACAAAAG CGGGTAATAA 7 62 0 

GTACTTAAGA AAATCTGCTG AAACGGTATA TTTTTCACCA CCAATTATAG AAAGAATTTG 7680 

ATTTGAAAAG AATAAAACTA TCAAAACTCC AAAGATAATA GGAATAAACA TAATCCGATT 774 0 

AATACTCTTA ACCGATTGTA TATCTTTAGT ACGTATCATA TGCGGATATA AACTATTCGC 7 800 

TATAGGATTA TACAATGATT TTGCTGCTGA AAGCAGTTGC ATTGCTATCC CCCAAAAGGC 7 860 

TATCTCTTGA CTTTGTAAAT AAAAACCCGA AATGACTGTC GTAAAGACGC CAAAAATAGT 7 92 0 

AGTTGCAAAA TTGGATAAAA AATAAATAGA GGATTCCTTT AAATCTTTAA CCCAAACAGA 7980 

CAGATAAGAA AATGATAATT T AAT T C CAT A ATAATGAAGG AATCTATAAG AAACTACTGC 8040 

AGCAACTAAA TTCCCAATTC CTTCCAATAT AGGAATCCAT AAAATAGAAG AATCATCTTT 8100 

TACTACAATA AATGTCAAAA TTGTAATGAT AGTTTTAGAA ATAATATAAG GAATTGCAAC 8160 

TGCATGCATC TTTTCAATTC CACGAAATAA AAAGTCAAAG ATAAAAATAT TGGTCACTGT 8220 

AGCTAACAAA TAAAAAACTG AAAAAAGAAT ATTCTCTCTC AT T ATTGGG A TTTGCCACAT 82 80 

CAATATGGTG TAAATTAGAA TCGAAATGAT AGATAAAAAT ATTTTTTCAA CTAGAGTATC 8340 

TCCAACTATC CTTCCAATCT TTGAGGGAGT AGTACAAGCA TTTACAATAT TTTTTGTAGC 8400 

TGATATCATG AAACCAAAAT CAATCACCAG TTGAACATAA GCTATTAACG CTTTAACATA 8460 

AATAACCATT CCATACGCGT CTAGCGAAAG CACCCTTGTC AAATACGGGA GT GT T AAT AA 852 0 

AGGAAATAGT AATTTAACAA T ATT C AG AAT ATAGAGAGAA CTTGTATTTT T TAT AAAT G A 8580 

AATTCTATCA ACTTTCACGA ACTAGTCCTT CCAAAAAAAG ATCTAAATAG TCCAAACTAC 8 640 

TTCTCGCTTT CAACACCAAT TCTGAAGGTA TTGTT AT CGG TTTTAGATGA AAAGTTTCAA 87 00 

GTTTCTTTAC AATACTATTA AC AC T TG AAT CAAATAAAGA TTCACAACGT TGTAACTCTC 87 60 

CAATTGCTCC ATAATAACGT GCTGTTTTTT CTGGATGGCA TGCAATGGCA AT C AC AG AT l 1 8820 

TATTAAAACA TGTTGCCACT ACCCCAACAT GTAATTTACA AGTTAAAACC ACATCTACCA 8880 

TTTTCAACAA TGATGTCATT TCTGCAGGAG AAT G AT AC TT GAATTGAAAA CAATCCTCAG 8940 

TTCTAACTAA TTTTCTAAAT TCCTGATAAT AAGCATCTTC ATAAGGTAGA ATGGAATCCG 9000 

AAGTTACTAC AACATAATAG TTAGGATTGT TTTCTAGAAA AAGACTAATT GATTCCGCAA 9060 

ATTTTTCAAG AGCTTTTTTG GAATGATTAT AGTGAACAAG AATTATCTTC TTATCTTTAG 912 0 

CTTCTCTTTT CAATTGACAC AGCTGCTCTG TTTTTTCTTC TCTTAATTTA CTTGAAATAA 9180 

TT AAAT C AAA GGTTTCATGC ACTGGAGCCG AAGGCGACAA ATGCTTCAAA GAATCAAATG 9240 
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ATTCTCGATC 


ACGAACTGTA 


ATAAATTGAG 


CAT G AT T AAT 


AATTCTCTTT 


AT AC CAT AAT 


9300 


TCATCAAAGA 


ATCGTTATTA 


GGCCCTGCAC 


CAATACCTAA 


TACTCCTATA 


GGCTTTTTAA 


9360 


AATATGAAGC 


CCAAATTCCC 


AAAGGTAAAA 


ATCGTTTAAA 


TTGGATTAAA 


TTATCACGAA 


9420 


AACGTGCATT 


ATGCCCTTCC 


CCAAAATATC 


CTCCCGGGAT 


ATACAAAATA 


GCATCTGCTT 


9480 


GTTTTTTAGT 


AAAACTTTGT 


TTTTGGCGAT 


ATTCTTTCAA 


GTACATTTGA 


AAGAAATCTG 


9540 


ATGGATTATA 


AAAAGAAACT 


TCATATCCTT 


TAGATTCTAA 


TAAATCATAG 


ACAATCTCAC 


9600 


CGTAAAGATA 


ATCACCGTAA 


TTACTTGAAC 


CATAATCCGT 


TGCACCATGT 


AACATAATTT 


9660 


TTTTCACCAC 


TATTTTTTCA 


ACCTCCTAAA 


AATAAATATC 


ATAATCAAAC 


TATACATAAT 


9720 


AGGACGATAA 


ACATCTATTG 


AACTACTTCT 


CACTAAAAGC 


AATAGTTGAG 


AAATTACCGA 


9780 


AAAATAAATA 


ACTTTTGAGA 


TTTTACTTGT 


TTGAAAAGCT 


CTGAAATTTA 


ATCGCCATCC 


9840 


ACTAAATATT 


CCCAAAACAA 


AACTCCAAAA 


AACACCACCA 


TAGTAACCAA 


AGTTCCAAAA 


9900 


TAATTCTTCC 


ACAAAAGAAG 


AGCCTACAGG 


T AAC C C C AAA 


AATTTATTAA 


T AAC AAC C GT 


9960 


CGCTGATGCT 


TTATCAAAAA 


AATCACCAAC 


TAACCATCCA 


ATAGGAAAAA 


TTGATAGGAT 


10020 


AGTGCGTAGA 


AATGTCATCC 


CATATTCATA 


TGGAATGCTA 


CTAGGCACAA 


C AGTT AC AG C 


10080 


AGAAGCTACT 


GTTAGGCTGG 


TCAGTCCCGA 


CTCTGAAAAT 


ACTTCCCCTA 


GTATATTCTT 


10140 


TACAAAATCT 


AATGAAGAAA 


AGG AAT C AAA 


TAAGTATATA 


CCTATAGTAT 


TCAAGTCGAA 


10200 


ACGGTGCCCC 


CTAATAACAA 


CTAATACATT 


TAATAGAAAT 


ACAGTTACTA 


TTAAAAATAC 


10260 


AAGTACTCTT 


TTCTTCGAAA 


AAGTAATCCC 


TAAAGATTGT 


GTGTATACTA 


AAACCAACGC 


10320 


CAAGATTGAA 


AACACCTGGA 


TTTTACGACT 


TCCTGTTAGG 


AT CAT T ATC A 


AAATTAGGTA 


10380 


AAACAACATT 


ACCCAAAAAA 


TAGTACGCTT 


TATAACTCGG 


GACAGCTTAT 


CTGAATAAAA 


10440 


CAAGGAGAAC 


ACACCAGGAA 


G C AT AAGT AC 


TCCTAAATCA 


TC T AT T ATT C 


CTGAACTAGC 


10500 


TGCCTCTGAA 


TATGCTGAAT 


AGCTATTCGC 


CGCTCTAACT 


GCTAGTACTG 


TTTTAGAATC 


10560 


AGTTATTACC 


CTAGAAATAA 


AGCCCACTCC 


TGTTAAAATC 


CTACCCGCAT 


TGTACAAAAT 


10620 


TTTCTCTTCA 


TTTTCCTGAT 


AATTTTGTAC 


TTCTGAATGA 


TAATGTACCT 


TTCCATCACT 


10680 


ATAAAAAAAT 


AAATAGCCTA 


CAGAATAACA 


AAACAAAATC 


CAAATTATAA 


AAAT AT AT G A 


10740 


ATGAAATAAT 


TCTTCATTAT 


TATAGAAGTT 


ACTAGGGCTC 


CACAGCAGAG 


TTGTTTGAAA 


10800 


CCCCATATAC 


TCATTGAAAA 


TTAATCCAAA 


CATAAAAAAA 


TAAGATAAAA 


TCAGATACCA 


10860 


TACAGAAAAA 


T CAT AT AT AC 


TAACTTTTTG 


TAAAATAAAA 


CCAGTAATTT 


GAAAAATAAT 


10920 


TAGAAAGCAA 


ACCCATATAA 


AT AT AG AC GG 


AACATAATTA 


GATATAAGAA 


AACCATTATT 


10980 
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CCAATTATCG 


AGAGTCCAGA 


ACAAGTAACA 


GAAAGCAAAT 


ATAAAACTTA 


ATGT C AC TAG 


11040 


TGTCACTCTA 


CAAATATACT 


TTGTCTGCAT 


CTATATCTCC 


TTTATTACAC 


ACATTTCTTG 


11100 


ATAACGATTC 


AATAATTTAC 


TAGCTTGATA 


ACAAATATCA 


TAGAGTCCAT 


CTGTCATACT 


11160 


GTTATTTATT 


TCAAAACGAT 


TGCATTCCTC 


AGATGTTAAA 


G AC AG T AC TT 


TATCTTTCCA 


11220 


TAGCAACACA 


GACTCTTCGT 


TGATAGGTAA 


GTAACTAATG 


TTTTTGGTCA 


CATCTACTTC 


11280 


TTGCGTCACT 


GTATCTGACG 


ATAAAATTTG 


TAATCCCGAT 


GCCTGAGCCT 


CTAC T AG AG A 


11340 


AACAGGCAAC 


CCCTCATATT 


TAGACGGAAG 


CAAAAAAACA 


TCCATCGCAG 


ATAATAAATC 


11400 


AGAAATATCA 


GTCCTTCTCC 


CTAAAAATAG 


CACATATGGG 


GTC AG AT T T A 


GTTCTAAAGC 


11460 


TTTCTGTTTT 


AATTTCTGCT 


CATCCTCACC 


ATTACCAACT 


AGGAGTAAAA 


TAACATTTGG 


11520 


TTTGATTAAA 


ATGAGTTCTT 


TTAAAACGTT 


AAATAAATAA 


CTTTGGTTTT 


TTTGATCTGA 


11580 


TAGGCGAGCT 


ATATTTCCTA 


ATACGAACTT 


ATTTGACACA 


TCTAATTCTC 


TACGACATTT 


11640 


TTCTCTAACA 


TCTGACAAAA 


ATTGATACTT 


TTTCAAATCA 


ATTG CAT TAA 


AAATAATTTC 


11700 


AATTTTTCCG 


TCTTTATACG 


CTTTCTCTCC 


ATATAACCAC 


TTAGCCGAAT 


CTTCCCCACA 


11760 


TGCAAACCAA 


TGAGTTGCTA 


AGATTTTTAC 


CAAAATTGTT 


ACTAATTTAC 


GCAATACTTT 


11820 


TTGAAAACTG 


TTTTCTGTTA 


CAT AAG C CAT 


ATGACTATGA 


ATAATTCTAA 


TTTTACAACC 


11880 


AATTATTTTA 


GAT AAG AT C A 


GACCAATTGC 


AGATTTATAG 


CCATGGCAAT 


G AACT AT AT C 


11940 


ATAATCTCCT 


TTCTTTATTA 


TTCTAGCAAG 


AGAGAGAAAC 


TGATGTAGAG 


GCTTTTTCCT 


12000 


TAATAGAGGC 


ACATGATAAA 


CCTTTGCACC 


CAATTCTTTC 


ATTTTATCCT 


CTAAAAATCC 


12060 


TTGTTCTTTT 


CCAGGCACAA 


TAAAATCAAA 


TTGAATTTTT 


TTTCTATCAA 


TGTGAGAATA 


12120 


ATAGTTGAAT 


AG AAAACTT T 


CTACTCCACC 


ACTATCTAGT 


GTTGTAAATA 


GATGTAATAC 


12180 


TTTAATCATT 


CTTCTTCCTT 


AAGCTTAAGA 


TTCGCTTCTC 


TAATTCTATT 


TCTGTTTTTT 


12240 


GTTTTTCTAA 


ACTAATTCTG 


TCCATGAAGT 


TATCACAATT 


CTTAATTAGC 


TGTTTCCTGT 


12300 


CAAGGTTTTG 


AATATACAAA 


GCCAAACAAT 


CTTTTTCCGA 


TTCATCCTTC 


ATAGGTAAAA 


12360 


CGAAACCAAA 


AC C ATT CTCT 


ATTG AC AC TT 


TTTCCATATA 


AGTATCTTCA 


CAAACTAAAA 


12420 


TAGGTTTATA 


CAACAATGCA 


GCAAAGTAGA 


GTTTATTAGA 


CAAAGCATAG 


TCTAGTAAGG 


12480 


GAGTGTGATT 


CCCGTATAAA 


TTCAAAACAA 


CATCTGTATT 


CTTATAAAAA 


GACATGGTAT 


12540 


CTTTAGGCTG 


GAATGTGTCC 


ACCAAGTTAA 


CATTGCTGAT 


ATTTTTTTCT 


TGACAAAATT 


12600 


CCCTTAATTC 


TCCTGCATTA 


GTACCTATAA 


AATTCAACTG 


AAATCGACTG 


TCATTTGCAA 


12660 


AAAAATC G AT 


TATTTTTTTA 


TTTTGTTCTT 


GAAAACGAAT 


TAAACCAATG 


TAGGAAAGTT 


12720 


GAATTGGAAA 


CGTACTATTA 


TTTTTTAACT 


GCTTTACCTC 


GTTTAATTCT 


ATCATATTGG 


12780 
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GTAGGTTATG 


GGTAGTAAAA 


TACTCTCCCA 


TTGGTAAAAA 


AAATTTATAG 


CCGTCTGAAG 


12840 


AAACGATATT 


CATTAAAGAA 


TTTTTCACCA 


ATTGTTTCTG 


AACCAAACGA 


TAAACCAAAA 


12900 


ATTTTTCATA 


ACTGTAATCA 


C G AAT AT CAT 


AAATATATCT 


ATTTTTAAAT 


GAAAAGAGAA 


12960 


G AAAATC T AC 


TAAAATGAAA 


GACACAATAC 


TATGTAACGG 


CAATATCATA 


T CAT AAT CAT 


13020 


TTTCTTTTAG 


CTTCTTTTTA 


ATTTCTTTTC 


TG AAT T T T AC 


AT AAC CT AAT 


ATCTTACTTA 


13080 


ATTTTCCTTT 


AC C AG AAAAA 


GAAATACGAT 


AGTAGTTTTG 


TTTTGTAATA 


ATCTCGTTAA 


13140 


TATTCTTATC 


C C AAT AT AT A 


ACATCGTAAC 


TAATAGACAG 


TTTCTTCAAT 


AATTCTTTAT 


13200 


AAAAATTGAA 


GTAAGGAGTT 


AGATATATAT 


TATCAGATAG 


TATAAACAGT 


ACTCTCATTA 


13260 


AATTATTCTT 


TCTTACTTTC 


CCTCTCTAAA 


CATGTCTCCA 


GTTCGAGCAT 


AAACTGCTCT 


13320 


TTTGAAAAGT 


GATTTTCATA 


GTAACAACGA 


GCTTTCTTTC 


CTAACTCTCT 


TTGTCTCTTA 


13380 


ATAGATAACA 


TACTAAATTT 


ACAAATATTT 


TTTGCCAATT 


GTTTTACATC 


TCGTTCGGGA 


13440 


C T AAC AT AT C 


CACAATTTGC 


TTCTTCTACA 


ATTATTTTAG 


CATCTCCTGA 


AATTGCACCT 


13500 


ATAATTGGTT 


TGCCTGCCGC 


CATATAAGAk 


TGTACCTTCC 


CAGGTATAGT 


ACGAGAAACT 


13560 


ATCGAGTCTC 


CTATTAAAGA 


AACTAACATA 


GCATCTGATT 


TTTTATAGAA 


GGATGGCATT 


13620 


TCCTCCAAAG 


AACGTCTTCC 


ATAGAAGGAA 


ATATTCTTTA 


ACTCCAATTC 


ATGAGCTAAT 


13680 


GCTTTCATGC 


TTAACAATTC 


CGTACCATCT 


CCAACAAAAT 


GAAAATGAAT 


TTTCTTGGGT 


13740 


AAATTGGTAT 


TCTTCTCTAT 


CAAACTGGCA 


GCTTTCAAAA 


TAGTTTCCAA 


ATTTTGTGCT 


13800 


TTGCCAATAT 


T AC C AGC AAA 


AGTTAGGTCA 


ACACTTTCTT 


TATTAACTAT 


AG AT TC AT C A 


13860 


GGGATAAAAA 


GATCTTCTGC 


ATATTGTGGC 


AAATATGTAA 


TCTTTTGTTC 


GGATATGTCA 


13920 


AATTGCTTCA 


CAAAATAATT 


TTTAAATGAT 


GGACTAGTGA 


CAAATATATA 


ATCACTAGCT 


13980 


CGGTAAACTT 


TTTTTGAGAT 


AAATTTAAAC 


AGCTTGAAAA 


TCAAGCCATC 


TTGTTTCACT 


14040 


CCACCTACGG 


TTAAACTATC 


TGGCCAAACA 


T C CAT AC AAT 


ATAGAAACAT 


CGGTTTCTTA 


14100 


TATTTTTTTT 


TATAAGCCAT 


ACCAGCCCAT 


GCCATCATAA 


CTGGAGACAA 


TTGGTTAACG 


14160 


AATACACAGT 


CAAAATTCGA 


TCCATCTTTC 


GTTTTATACC 


TCCCCAATAA 


AAC TCCT AAA 


14220 


GTAGAACTAA 


TTGCAAAGCT 


AAAATAATTC 


AACAATCGAA 


ATACAACACT 


TTTTTTTCTA 


14280 


GGGATTGTAT 


AAGAACGATA 


TATCGTAACA 


CCTTCTATAA 


TCTCACGTCT 


TTTTTTATTA 


14340 


TGACGATAAT 


C T GC AT AT AT 


CTTCCCTTCA 


GGGTAATTAG 


GAATCCCAGC 


CAAAACAGAG 


14400 


ACTTCATGCC 


CTTTTCGAAC 


TAAATCTTCA 


CAAATATCTG 


ACAACCTGAA 


TGGTTCTGGC 


14460 


TTATAATGTT 


GGCAAACAAA 


TAGTATTTTC 


ATTGTCCAAT 


TTAACTTTCT 


TTCTTACCAC 


14520 
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TACCCTCTAC 
TATCCATTAT 
CAGACAAAGT 
TTGCTCCATA 
GACCTACAAT 
ATGTTTTTCG 
GAGGCGCCAC 
ATTCTTTATG 
TAATCGCAAT 
ATAATATATC 
TATTTCCATT 
C AG AAACT C A 
CTACGTCAGA 
GTTCTCAATA 
TTTGAAGACA 
ATGCTCTGTA 
TTTTTACCAT 
TTATTTAAAA 
TGGATATCAC 
TGAGTGATAA 
TCAAAATATT 
GTAGGGTTTG 
CATAAACCGT 
TCACGAGATT 
GTTTTATAGC 
TCCCCAGGGT 
TTTGTACACA 
GCTATTTCTA 
CCCAAAAGTG 
AAAAGTTCAA 



AATACCTTTT 
CAAAGACAGA 
ATCACGCCCG 
CTTATCTCTC 
ACTCATATTA 
CAAGAAAGCC 
ATTTTTAGGT 
AATACCAAAG 
TGCAATTATC 
ACCTAATCGT 
TTTCATTCTA 
AT AT AT AT T T 
TAAAACTTTT 
ATG AAT TAG A 
TATCGGTTTC 
AG AAT G C C AG 
AAT TT C CAT A 
CAACTCCTAG 
GTTTATTCGC 
TTGCCGCATC 
TACGCAATGT 
GTGATACAGA 
GAGATAAATC 
TAAAAACTCC 
CTGCACGCGC 
TAACAGAAGT 
AGGCATTGTA 
ATGTCGGCAT 
TCATCTGCAA 
GATGAAGAAC 



CGTTTCAGTA 
TGTTTAACAT 
TTAATTTGTG 
TCTGCAATCA 
CCAACAAGAA 
CCTACTTTTG 
GCATCTATTT 
CGTTTTTGCT 
AT AAAAAC C G 
T T T ATT AT AC 
TTTCCATTTG 
GGTCACTCAA 
CTCCATCTAA 
AGGTCCAGTT 
TGAAATTGCA 
AG T T AT AT TG 
GGAACCGTAA 
GAACAGTTTC 
CTCACCTGTT 
AATAACAATT 
TTCAATCATA 
TCCCGATTGA 
AGCTGTCCCA 
TAACATAACT 
AAACGACCAT 
AACGGAAATT 
ATATTCTTCT 
CCTTCTCTCC 
TGTATTTTCG 
TATAACACTA 



356 
CGTAAGGTAT 

AGTAGCCATC 

CCCATCCAGT 

AATCTAGTTC 

TATTAAACAA 

T AAT Cy AT TG 

T C AT AG ACCT 

TAAATATAAC 

GACACAATAT 

CGTACATAAA 

ACAAATTAAA 

TGATTTTCAG 

ACAAAATTTA 

CAATTATTCT 

ATCAGTACAT 

TAGTCCCCTT 

ACTCCATACT 

CCTGTTTGTT 

GCTGTTACCA 

CCAATAGGCG 

TCATTAAAAT 

ACTACAAATA 

GATAAAAATT 

GAATTTCGAG 

GCTATATTTA 

ACTTTTAGTT 

GCCTTCTTAA 

TATTTCAACT 

ATATCTTCCG 

GTTCCAATCA 



TGTCTTAACT 
TAACTCCGTC 
TAACCCTGGC 
ATTTATACCC 
TTGTGGTAGT 
CTCTGGATTA 
AAATT T C AAA 
CGGACCTTCT 
TATTATCCCT 
CAACCTCCAA 
TCAGGCAGTA 
AAATATAATT 
TTTGTTTCAG 
TCCAAATAGA 
AAGCTAATAA 
CCATACTATA 
TATTAACCGA 
TTAATTGTTG 
AGATGGACGC 
GTGTATCAAT 
TTTTACTTTG 
AATTTTCAAT 
CTGTTAGCCC 
TATCGCCATC 
TGGAAGTAGT 
TATCTCCGCT 
TGAACTCCAG 
TACCCAAGTT 
GACGTTTCAC 
CCCCTGCCAA 



ATACATCTAA 
TTCATCTCAA 
AAGATATCAT 
GCTGGTCTAG 
TCATCCAAAG 
TATAAGTTTC 
ATATAGAAGT 
GAATCAAGTT 
ATTAAAGATA 
CTATAAATTC 
CATGCAACTA 
CTTTTATCCT 
TAATATATGA 
CCGAATATTA 
ACTGATAAGT 
TTCATTTTAT 
GAT AT C C AAT 
TTTCGCTTTT 
AT C AC AC TT T 
AAT G AT AT AA 
TAACAAGGCT 
ATTTGTATCA 
TGTAATTTTT 
GATCAAAAGA 
TGTTTTTCCT 
CAACTGTATA 
TTTTTTTTGT 
TGGCACAACT 
ACGAGTATCC 
AAAACCAATT 



14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
.15960 
16020 
16080 
16140 
16200 
16260 
16320 
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AGTGTATTGC GTTTAATATT TGGCGAAGAC GGGGATATCG CCGGCCTTGC CTCCTCCAGT 163 80 

GTTGTCACGT C AG AAAC AC G AGTAATACTG ATAATTTTTT GAGCAGCTAC TTCTCTCAAA 16440 

GAGTTAGCGA TACGGCTTGC CTCTTCAGGA ACTCGATCAT TAACTGAAAT AGAGACAATA 16500 

CGGGTATCAA CTGGTACTGT CACTTTAATT TTATTAGCCA AACCTTTTGG CGTCAAATCT 165 60 

AGTTTCAAAT CAGAAACAAC TTCCTCCAAA ACATCCTGCG AAAGGATAAT CTCACGGTAG 16 620 

TCTTTTACCA GATAAGTTCC TGCCTGCAAA TCCTGATTTG TCAACCCCGG CTTGTCTCCT 16 680 

TGATTGCGAT TCACTACGTA AATTCGCGTG GTACTCGTAT ATTCTGGCTT AACAATAAAA 16740 

GTGCTATATG CAAAAGCCCC CGCACCTGTC ACAAGTGCCA CTATTAAAAT CATTAGCTTG 16800 

CGTTTCCACA AGCTTTTAAC TAATTGAAAT AC AT C G ATTT CTATCGTATT TTGTTCTTTC 16860 

ATCATTTCTC CTAAATTAGT TGATCCATTA CAATTTTTCG AGGATTGTCT ATAAAAAGTT 16920 

CCTGAGCCTT CGCTTCTCCG TATTTTTGGG TAACAAGGTC ATATGCTTCT GCCATATGAG 16980 

GAGGTCTACC GTCTAGATTG TGCATATCAC TTGCAATGAC AT G AAC C AAA TCCTGCTCTA 17 04 0 

AAAAAT AC T G AGCTCTTTTT TTCATGAATT T AT AAC GT T C GCCAAAAAGT TTGGGTTTGA 17100 

GGACATGTGA ACTATTTACT TGCGTGTAAC AGCCCATATC GATCAGTTCT CGAACGCGTT 17160 

TTTCATTATT TTCAAGAGCA TCATAGCGCT CAATGTGGGC AATGACTGGA GTAATTCCCA 1722 0 

ACATCAAGAT CTTGCTCAAG GCGCTATGAA TATCGCGATA AGGAGTGTTC ATACTAAACT 172 8 0 

CTATCAAGGC ATAACGACTA TCATTGAGGG TCGGAATCCG CTTTTTTTCC AGCTTATCCA 17340 

GAACATCTGG TGTGTAATAA ATTTCAGCCC CGTAAGCAAT GACCAAGTCA CTCGCCACTT 174 00 

CCTTAGCTAT TTCCCGAACC TGAAGAAAGT TTTCTGCTAT CTTCTCTTCC GGAGTTTCAA 174 6 0 

ACATGCCCTT GCGACGGTGA GAGGTAGAAA CAATGGTTCG CACCCCCTGT CTGTAGGATT 1752 0 

CTGCCAAGAG AGCCTTGCTT TCCTCTCTTG ACTTGGGACC GTCATCTACA TCAAAAACGA 17 580 

TATGCGAATG GATGTCTATC ATTTCATCTA CCCTCCATCA CATCCTGTAT AGCTGCTTTA 17 64 0 

ACTACAGCTA AACTACTATC ATCTATTTCC ATCACATAGA GGTTACTGTC TGGCATTGCA 17 700 

TAAGAAGGAA GATCCATCCG ACCTGTCCCT TTTAAATCTT GAGAATTTAC TTTATAATTC 177 60 

CCTCCACTTT CTAACTGAGC ATTGACCAAA TTTATCATGG TCTCAAGTGG CATATTTGTT 17 82 0 

TGGATAGAAT CTTGCAAGCT ATT AATG AT C GTACTATAAT TTTTCAGCAC TTCGGTTGAC 17880 

GTTAATTTTT GAAGGATAGC CACAATCACC TTTTGTTGAT GGCGCCCGCG GTCACGATCG 17 94 0 

CCATCTGCTA GGGAGTAGCG CTCACGAACA AAACCGAGAG CCTGTTCTGA ATCAAGATGA 18000 

ACATTGCCTG CAGGGTAATA CTTTCCATTC GTATGGGCAG TAAATTCTTG ATCATTATAA 180 60 
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ACATCAATTC CACCCAACAA ATCAATCAAT TTCAAAAACG AAGTGAAGTT CAATCGCACA 1812 0 

TAGTAATTGA TATCCACTCC ATAGAGATTT TCTAAGGTGT GAATGGACGA ATCAACTCCA 18180 

TAAATGCCCG CATGAGTCAA TTTATCTTTT TGATTATTTC CACCATCTGC GATTGGTACA 18240 

TAGGCATCAC GTGGCGTTGT GGTCAAGAGG ATTTTCTTGG TATCTCGATT GACAGTCATC 183 00 

AGGATGTTGA CATCTGATCG CGACACCGAA CTAATAGGAC CATAGGTGTC AATTCCACTA 183 60 

ACATAGATAT TGAAAGACTG ACTCTTAGAC GTCTTAGGAG CTTCTACTTT TTTAGTGAAT 1842 0 

CCCTTAGTAT AAATCTTTTT TATCTTCGAT GCGTAGTCTG G AT ACT CTG A CTCGATGATG 184 8 0 

TTTTCAAAGA CACTATTTAG GACAATGGCC TTAGTCTCCC CTGCAATCAA ACTCTTGTAA 18540 

GCTGCCAAGT AAGACGAACT CTGGTTGACC GTCAAATCGG TATTCTGACT TGACTTGATA 18600 

TCAGCTAGTA ATTTCTGAAT ATTTTCATTA TTAGTCCCAG TCGGTGCTGT CACACTCGTC 18660 

AGTTGCGTAA CATTTTCGAT CTCACTATCT GCTAAAACAG CGACACTGAT TGAATATTCT 18720 

GAGTAATTAG AAGTCGCATT TAAACGATTG GTCAGTCCAA CAAACTGCTG TACTGCAAAG 187 80 

AGCGACACAG AGCTGACAAG GATAGAGAAC ACCAACAGAA AAATAGTAAA CTTTTCAGCT 18840 

TTTTTATAGA TAATCAAGAG TAGCCCTACC AAGGCAACTA GTAGGACTAA CGCAGTTACC 18900 

ACTAGATTAA GATATCTAAA AG C AAGG AT A TTGTACTTAA AGATTAAGAA CAATAAAAAA 18 9 60 

CAAACTAACA ATAAATAAAT AGTCAGCAAA AC TAT ATT AA CACTTCGCTT CACTTTCTGT 19020 

G AACGTG AT T TTTTAAAACG TCTACTCATG ATTAATACCT ATACATTGAA CAT T AT ACG A 190 80 

TT AT AT C AC T TTTTTACGGT AATGTCTACA CCTTTATTTT TACTATCTGC ATCTTTAAGT 1914 0 

AT CTT AGT AG ACTTCCCGCG AAACAAAAAT ATAGTAAAAT GAAATAAGAA CAGAACAAAT 19200 

CGTTCAGGAC AGTCAAATCG ATTTCTAACA ATGTTTTAGA AGCAGAGGTG 19250 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21706 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 

AAAGT T G AAA GACTGCTAGC TGTTTTTGAT ACCAATCGTT TCCAACTACA GAG C AAAC AG 60 

TATACAAAGT TTGTTTTTGG ATGTAAGCTT CTTGATGGAC AATTCCAAGA AAATCAAGAA 12 0 

ATTGCTG AC C TTCAATTTTT TGCCATTGAC CAACTGCCGA ACTTATCTGA AAAACG C ATT 180 

ACCAAGGAGC AAAT AG AG CT TCTTTGGCAG GTTTATCAAG GTCATAGGGG GCAATATCTT 24 0 
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ATGATTATCG 


TATTTCTAAA 


TCCATTTTTA 


ACAACTAGCA 


TGGTATAATA 


300 


/\ 1 /\ 1 ^L./\V3LjA 


AAATTTTGAA 


T TAT GAGG AA 


GACTAGATGA 


ATTTATGGGA 


TATTTTCTTT 


360 




CAACCGAGCC 


GC CC AAAT T T 


GACCTTTTTT 


GGTATGTTAG 


CCTATTTACG 


420 


CTCTTAGCCT 


TAACCTTTTA 


T AC AG C C C AT 


CGCTATCGTG 


AAAAGAAGGT 


TTACCAACGA 


480 


11111 LLAM 


TCTTGCAGAC 


TGTTCAGTTA 


ATCCTTCTTT 


ATGGTTGGTA 


CTGGGTCAAT 


540 


L.ATATGCCAC 


TGTCAGAAAG 


CCTACCCTTT 


TACCATTGCC 


GTATGGCTAT 


GTTTGTGGTA 


600 


CTCTTGCTTC 


CTGGTCAATC 


CAAATATAAA 


CAATACTTTG 


CATTATTGGG 


AACATTTGGG 


660 


AC AT TAG C AG 


CCTTTGTTTA 


TCCAGTGCCA 


GATGCTTACC 


CTTTTCCACA 


TATCACCATT 


720 


CTATCCTTTA 


TCTTTGGTCA 


TTTAGCACTC 


TTGGGGAACT 


CTCTAGTTTA 


TCTATTGAGA 


780 


CAGTATAATG 


CGCGATTGCT 


GGATGTGAAG 


GG AATTTTTC 


TCATGACCTT 


TGCCCTAAAT 


840 


GCCTTGATTT 


TTGTGGTCAA 


TTTGGTGACA 


GGTGGCGATT 


ACGGATTTTT 


GACAAAACCG 


900 


CCATTGGTTG 


GGGATCACGG 


TCTAGTAGCT 


AATTATTTAC 


TTGTTTCAAT 


TGTGCTGGTA 


960 


GCTACTATCA 


GTTTGACTAA 


G AAAAT CTT A 


GAATTCTTTT 


TAGCTCAAGA 


AGCAGAAAAA 


1020 


ATGATTGCAA 


AGGAAGCTTA 


AC AC AG AGC T 


TTCTTTTTTG 


CTCTTAGAGA 


GTTTTTACAA 


1080 


GCAGCTTATA 


AAATAAGAAT 


TTCTGAATAG 


AC AAACT C AA 


AAAATGGCTG 


GGAAATTTAG 


1140 


GAAAAAAGCA 


AG C ACG AT T A 


AATTTTTTGT 


GTTATAATAT 


TTTGTGAATA 


GCTATGCCTA 


1200 


TGTTTAGCTA 


TGGAATAATA 


CGAAGTGCGA 


AACTTGGAAG 


ATAGAGAGGA 


AGCGATGTAA 


1260 


TGGCTAGAGA 


AGGCTTTTTT 


ACAGGTCTAG 


ATATTGGAAC 


AAGCTCTGTC 


AAGGTGCTTG 


1320 


TGGCCGAGCA 


GAGAAATGGT 


GAATTAAATG 


TAATTGGCGT 


GAGTAATGCC 


AAAAGTAAAG 


1380 


bibl AAAGGA 


TGGAATTATT 


GTTGATATTG 


ATGCAGCAGC 


AACTGCTATC 


AAGTCAGCCA 


1440 


TTITTT 1 /^ A A/-'/"' 

111 LLLMGC 


GGAAGAAAAG 


GCAGGCATTT 


CG ATT AAAT C 


AGTGAATGTC 


GGCTTGCCTG 


1500 


GTAATCTTTT 


GCAGGTAGAA 


CCAACTCAGG 


GGATGATTCC 


AGTAACATCT 


GATACTAAGG 


1560 


AAATTACGGA 


TCAAGATGTT 


GAAAATGTTG 


T C AAAT C AG C 


TTTGACAAAG 


AGTATGACAC 


1620 


CTGACCGTGA 


AGTCATTACC 


TTTATTCCTG 


AAGAATTTAT 


TGTGGATGGT 


TTCCAAGGGA 


1680 


TTCGTGACCC 


ACGTGGCATG 


ATGGGGGTTC 


GCCTTGAAAT 


GCGTGGTTTG 


CTTTATACAG 


1740 


GACCTCGTAC 


TATCTTGCAC 


AATTTGCGTA 


AGACGGTTGA 


GCGTGCAGGT 


GTTCAGGTTG 


1800 


AAAATGTTAT 


CATTTCACCA 


CTAGCAATGG 


TTCAGTCTGT 


TTTGAACGAA 


GGGGAACGTG 


1860 


AATTTGGTGC 


TACAGTGATT 


GATATGGGGG 


CAGGTCAAAC 


GACTGTCGCT 


ACAATCCGTA 


1920 


ATCAAGAACT 


CCAGTTCACA 


CATATTCTCC 


AAGAAGGTGG 


AGATTATGTA 


ACTAAAGATA 


1980 
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TCTCCAAGGT 


TTTGAAAACC 


TCTCGCAAAT 


TAGCGGAAGG 


CTTGAAACTG 


AATTACGGGG 


2040 


AAGCCTATCC 


GCCTCTTGCA 


AGCAAAGAAA 


CCTTCCAAGT 


AGAGGTTATT 


G G AG AAGT AG 


2100 


AAGCAGTCGA 


AGTGACGGAA 


GCCTACTTGT 


CAGAAATTAT 


TTCTGCACGA 


ATCAAGCACA 


2160 


TCCTTGAACA 


AATCAAGCAA 


GAATTAGATA 


GAAGGCGTCT 


ATTGGACCTC 


CCTGGTGGTA 


2220 


TTGTCTTAAT 


CGGTGGGAAT 


GCCATTTTAC 


CAGGTATGGT 


TGAGCTTGCT 


CAGGAAGTCT 


2280 


TTGGCGTCCG 


TGTCAAGCTT 


TATGTTCCAA 


ATCAAGTTGG 


TATCCGTAAT 


CCAGCCTTTG 


2340 


CGCATGTGAT 


TAGTTTATCA 


GAATTTGCGG 


GTCAATTAAC 


AGAAGTTAAT 


CTTTTGGCTC 


2400 


AGGGAGCGAT 


AAAAGGTGAG 


AATGACTTAA 


GTCATCAGCC 


AATTAGTTTT 


GGTGGGATGC 


2460 


TGCAAAAAAC 


AGCTCAGTTT 


GTACAATCAA 


CGCCTGTTCA 


ACCAGCTCCT 


GCTCCAGAAG 


2520 


TAG AG C CGGT 


GGCGCCTACA 


GAACCAATGG 


CGGATTTCCA 


ACAAGCTTCA 


CAAAATAAAC 


2580 


CGAAATTAGC 


AGATCGTTTC 


CGTGGATTGA 


TCGGAAGCAT 


GTTTGACGAA 


TAAAGAGGAA 


2640 


AAATAAATTA 


TGACATTTTC 


ATTT GAT AC A 


GCTGCTGCTC 


AAGGGGCAGT 


GAT T AAAGT A 


2700 


ATTGGTGTCG 


GTGGAGGTGG 


TGGCAATGCC 


ATCAACCGTA 


TGGTCGACGA 


AGGTGTTACA 


2760 


GGCGTAGAAT 


TTATCGCAGC 


AAACACAGAT 


GTACAAGCAT 


TGAGTAGTAC 


AAAAGCTGAG 


2820 


ACTGTTATTC 


AGTTGGGACC 


TAAATTGACT 


CGTGGTTTGG 


GTGCAGGAGG 


TCAACCTGAG 


2880 


GTTGGTCGTA 


AAGCCGCTGA 


AGAAAGCGAA 


GAAACACTGA 


CGGAAGCTAT 


TAGTGGTGCC 


2940 


GATATGGTCT 


TCATCACTGC 


TGGTATGGGA 


GGAGGCTCTG 


GAACTGGAGC 


TGCTCCTGTT 


3000 


ATTGCTCGTA 


TCGCCAAAGA 


TTTAGGTGCG 


CTTACAGTTG 


GTGTTGTAAC 


ACGTCCCTTT 


3060 


GGTTTTGAAG 


GAAGTAAGCG 


TGGACAATTT 


GCTGTAGAAG 


GAATCAATCA 


ACTTCGTGAG 


3120 


CATGTAGACA 


CTCTATTGAT 


TATCTCAAAC 


AACAATTTGC 


TTGAAATTGT 


TGATAAGAAA 


3180 


ACACCGCTTT 


TGGAGGCTCT 


TAGCGAAGCG 


GATAACGTTC 


TTCGTCAAGG 


TGTTCAAGGG 


3240 


ATTACCGATT 


TGATTACCAA 


TCCAGGATTG 


ATTAACCTTG 


ACTTTGCCGA 


TGTGAAAACG 


3300 


GTAATGGCAA 


ACAAAGGGAA 


TGCTCTTATG 


GGTATTGGTA 


TCGGTAGTGG 


AGAAGAACGT 


3360 


GTGGTAGAAG 


CGGCACGTAA 


GGCAATCTAT 


TCACCACTTC 


TTGAAACAAC 


TATTGACGGT 


3420 


GCTGAGGATG 


TTATCGTCAA 


CGTTACTGGT 


GGTCTTGACT 


TAACCTTGAT 


TGAGGCAGAA 


3480 


GAGGCTTCAC 


AAATTGTGAA 


CCAGGCAGCA 


GGTCAAGGAG 


TGAACATCTG 


GCTCGGTACT 


3540 


TCAATTGATG 


AAAGTATGCG 


TGATGAAATT 


CGTGTAACAG 


TTGTTGCAAC 


GGGTGTTCGT 


3600 


CAAGACCGCG 


TAGAAAAGGT 


TGTGGCTCCA 


CAAGCTAGAT 


CTGCTACTAA 


CTACCGTGAG 


3660 


ACAGTGAAAC 


CAGCTCATTC 


ACATGGCTTT 


GATCGTCATT 


TTGATATGGC 


AGAAACAGTT 


3720 


GAATTGCCAA 


AACAAAATCC 


ACGTCGTTTG 


GAACCAACTC 


AGGCATCTGC 


TTTTGGTGAT 


3780 
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TGGGATCTTC GCCGTGAATC GATTGTTCGT ACAACAGATT CAGTCGTTTC TCCAGTCGAG 3 840 

CGCTTTGAAG CCCCAATTTC ACAAGATGAA GATGAATTGG ATACACCTCC ATTTTTCAAA 3 9 00 

AATCGTTAAG TAAATGAATG TAAAAGAAAA TACAGAACTT GTTTTTCGAG AAGTTGCAGA 3 960 

GGCTAGTCTG AGTGCTCATC GAGAGAGTGG TTCGGTCTCT GTCATTGCAG TTACCAAGTA 4020 

TGTAGATGTA CCGACAGCGG AAGCCTTGCT TCCGCTAGGT GTCCATCATA TCGGTGAAAA 408 0 

TCGTGTAGAT AAGTTTCTGG AAAAATATGA AGCTTTAAAA GATCGAGATG TGACTTGGCA 4140 

TTTGATTGGT ACCTTGCAAA GACGTAAGGT GAAAGATGTC ATTCAATACG TT G AT T ATT T 42 00 

CCATGCATTG GACTCAGTAA AGCTAGCAGG GGAAATTCAA AAAAGAAGTG ACCGAGTCAT 4260 

CAAGTGTTTC CTTCAAGTAA ATATTTCTAA AGAAGAAAGC AAACACGGTT TTTCGAGAGA 4320 

GGAACTGCTG GAAATCTTGC CAGAGTTAGC CAGACTAGAT AAGATTGAAT ATGTTGGTTT 4380 

AATGACGATG GCACCTTTTG AGGCTAGCAG TGAGCAGTTG AAAGAGATTT TCAAGGCGGC 4440 

CCAAGATTTA CAAAGAGAAA TTCAAGAGAA ACAAATTCCA AATATGCCTA TGACCGAGTT 4 5 00 

AAGTATGGGA ATGAGTCGTG ATTATAAAGA AGCGATTCAA TTCGGTTCCA CTTTTGTTCG 45 60 

TATAGGTACA TCATTTTTTA AGTAGGAGAG AACCATGTCT TTAAAAGATA GATTCGATAG 4 620 

ATTTATAGAT TATTTTACGG AGGATGAGGA TTCAAGTCTC CCTTATGAAA AAAGAGATGA 4 680 

GCCTGTGTTT ACTTCAGTAA ATTCTTCACA GGAACCGGCT CTCCCAATGA ATCAACCTTC 47 4 0 

ACAGTCGGCT GGCACAAAAG AGAACAATAT CACCAGACTT CATGCAAGAC AACAGGAATT 4 800 

GGCAAATCAG AGTCAGCGTG CAACGGATAA GGT C ATT AT A GATGTTCGTT AT C C TAG AAA 4 8 60 

ATATGAGGAT GCAACAGAAA TTGTTGATTT ATTGGCAGGA AACGAAAGTA TCTTGATTGA 4 920 

TTTTCAGTAT ATGACAGAGG TGCAGGCTCG TCGTTGTTTG G AC T ATTTGG ATGGAGCTTG 4 980 

TCATGTTTTA GCTGGAAATT TGAAAAAGGT AGCTTCTACC ATGTATTTGT TGACACCAGT 5 040 

GAACGTTATT GTAAATGTTG AAGATATCCG TTTACCAGAT GAAGATCAAC AGGGTGAGTT 5100 

CGGTTTTGAT ATGAAGCGAA AT AG AGT AC G ATAATGATTT TTTTAATTCG TATGATTTAT 5160 

AATGCAGTGG ATATTTACTC CCTGATTTTG GTAGCCTTCG CTGTCATGTC TTGGTTTCCA 52 2 0 

GGTGCCTACG AATCCAGTTT AGGTCGTTGG ATTGTAGCGT TGGTGAAACC AGTGCTTGCT 52 80 

CCCTTGCAAC GCCTGCCTTT AC AG AT AG CG GGTCTTGATT TATCTGTTTG GGTTGCGATT 5340 

GTTTTGGTTC GATTTTTAGG AGAAAACCTA GTGCGTTTTC TGGCGATGAT AGGATGAATA 54 00 

AAGGGATTTA TCAGCATTTC TCCATAGAAG ATCGTCCATT TCTTGACAAG GGAATGGAAT 5460 

GGATAAAGAA GGTAGAAGAT AGCTATGCTC CTTTTTTAAC TCCTTTTATC AATCCTCATC 5520 
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AGGAGAAGCT ATTAAAGATT TTGGCCAAAA CCTATGGTCT TGCTTGTAGC AGTAGTGGGG 5580 

AATTCGTCTC GAGTGAGTAT GTTCGAGTTT TATTATACCC AG ATT AT T T C C AACC AG AG T 5640 

T T TC AG ATT T TGAAATATCT CTCCAGGAAA TTGTGTATTC CAATAAATTT GAACATTTAA 57 00 

CGCATGCTAA GATTTTAGGG AC AGT C AT C A ATCAATTAGG GATTGAACGG AAACTTTTTG 57 60 

GAG AT AT C C T AGTAGATGAA GAACGGGCGC AGATTATGAT TAATCAGCAG TTTCTTCTTC 582 0 

TCTTTCAAGA TGGACTAAAG AAAATTGGTC GTATACCTGT TTCGCTGGAG GAACGTCCTT 5880 

TCACCGAGAA AATAGATAAG CTAGAACAGT ATCGAGAACT GG ATT T AT CT GTGTCTAGTT 59 40 

TTCGATTAGA TGTTCTTTTA TCAAATGTTT TGAAACTATC TAGGAATCAA GCAAACCAGT 6000 

TGATTGAAAA GAAACTTGTC CAAGTAAATT ATCATGTGGT AGACAAATCA GATTACACTG 6060 

TTCAAGTTGG AGACTTGATT AGTGTGAGAA AATTTGGTCG CTTGAGATTA CTTCAAGATA 612 0 

AGGGACAAAC GAAAAAAGAG AAGAAAAAAA TAACCGTCCA GT T ATT AT T A AGTAAGTGAG 6180 

GAATAGAATG CCAATTACAT CATTAGAAAT AAAGGACAAG ACTTTTGGAA CTCGATTCAG 624 0 

AGGTTTTGAT CCAGAAGAAG TCGATGAATT TTT AG AT AT T GTGGTTCGTG ATTACGAAGA 63 00 

TCTTGTGCGT GCGAATCATG ATAAAAATTT GCGTATTAAG AGTTTAGAAG AGCGTTTGTC 63 6 0 

TTACTTTGAT GAAATAAAAG ATTCATTGAG CCAGTCTGTA TTGATTGCTC AGGATACAGC 6420 

TGAGAGAGTG AAACAGGCGG CGCATGAACG TTCAAACAAT ATCATTCATC AAGCAGAGCA 6480 

AGATGCGCAA CGCTTGTTGG AAGAAGCTAA ATATAAGGCA AACG AG AT T C TTCGTCAAGC 654 0 

AACTGATAAT GCTAAGAAAG TCGCTGTTGA AACAGAAGAA TTGAAGAACA AGAGCCGTGT 6600 

CTTCCACCAA CGTCTCAAAT CTACAATTGA G AGT C AGT TG G CT ATTGTTG AATCTTCAGA 6660 

TTGGGAAGAT ATTCTCCGTC CAACAGCTAC TTATCTTCAA ACCAGTGATG AAGCCTTTAA 672 0 

AGAAGTGGTT AGCGAAGTAC TTGGAGAACC GATTCCAGCT CCAATTGAAG AAGAACCAAT 67 8 0 

TGATATGACA CGTCAGTTCT CTCAAGCAGA AATGGCAGAA TTACAAGCTC GTATTGAGGT 6840 

AGCCGATAAA GAATTGTCTG AATTTGAAGC TC AG ATT AAA CAGGAAGTGG AAGCTCCAAC 6900 

TCCTGTAGTG AGTCCTCAAG TTGAAGAAGA GCCTCTGCTC ATCCAGTTGG CCCAATGTAT 69 60 

GAAGAACCAG AAGTAGCTCC AATGCATCCG ATAGGTCCAA CACCAGCTAC AGAAACTGTT 702 0 

GATTCAATAC CGGGATTTGA AGCACCGCAA GAATCTGTTA CAATTTTATA AGAAAT AT T C 7080 

TGAGAACAAT ATCTTATCCT TATATTTCCA GCGAGCAGGA GATGGTGTGA GTCCTGTAAT 7140 

CCCTATTGAT AAGATTATCC TCTCAAAAAC TCAAGTCTGA AGCTAGTAAG ATTTGACGTT 7200 

TCCCACGTTA CGGGATAAGA GGGAGAAAGA CTAAATCTTT TTCCGAATAA AGGTGGTACC 72 60 

ACGATTTTCG TCCTTTTTGG AAGTCGTGGT TTTTAATTTG TTATTATTTA TAAAGGAGAT 732 0 
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ACCATGAAAC TCAAAGACAC CCTTAATCTT GGGAAAACTG AATTCCCAAT GCGTGCAGGC 7 3 80 

CTTCCTACCA AAGAGCCAGT TTGGCAAAAG GAATGGGAAG ATGCAAAACT TTATCAACGT 7 440 

CGTCAAGAAT TGAACCAAGG AAAACC TC AT TTCACCTTGC ATGATGGCCC TCCATACGCT 7 500 

AACGGAAATA TCCACGTTGG ACATGCTATG AACAAGATTT CAAAAGATAT CATTGTTCGT 7 5 60 

TCTAAGTCTA TGTCAGGATT TTACGCACCA TTTATTCCTG GTTGGGATAC TCATGGTCTG 7 620 

CCAATCGAGC AAGTCTTGTC AAAACAAGGT GTCAAACGTA AAGAAATGGA CTTGGTTGAG 7 6 80 

TACTTGAAAC TTTGCCGTGA GTACGCTCTT TCTCAAGTAG ATAAACAACG TGAAGATTTT 77 4 0 

AAACGTTTGG GTGTTTCTGG TGACTGGGAA AATCCATATG TGACCTTGAC TCCTGACTAT 7 800 

GAAGCAGCTC AAATTCGTGT ATTTGGTGAG ATGGCTAATA AGGGTTATAT CTACCGTGGT 7860 

GCTAAGCCAG TTTACTGGTC ATGGTCATCT GAGTCAGCAC TTGCTGAAGC AGAGATTGAA 7 92 0 

TACCATGACT TGGTTTCAAC TTCCCTTTAC TATGCCAACA AGGTAAAAGA TGGCAAAGGA 7 980 

GTTCTAGATA CAGATACTTA TATCGTTGTC TGGACAACGA CTCCATTTAC C AT C AC AG CT 8 04 0 

TCTCGTGGTT TGACGGTTGG TGCAGATATT GATTACGTTT TGGTTCAACC TGCTGGTGAA 8100 

GCTCGTAAGT TTGTCGTTGC TGCTGAATTA TTGACTAGCT TGTCTGAGAA ATTTGGCTGG 8160 

GCTGATGTTC AAGTTTTGGA AACTTACCGT GGCCAAGAAC TCAACCACAT CGTAACAGAA 82 2 0 

CACCCATGGG AT AC AGCT GT AGAAGAGTTG GTAATTCTTG GTGACCACGT TACGACTGAC 82 80 

TCTGGTACAG GTATTGTCCA TACAGCCCCT GGTTTTGGTG AGGACGATTA CAATGTTGGT 8340 

ATTGCTAATA ATCTTGAAGT CGCAGTGACT GTTGATGAAC GTGGTATCAT GATGAAGAAT 84 0 0 

GCTGGTCCTG AATTTGAAGG TCAATTCTAT GAAAAGGTAG TTCCAACTGT TATTGAAAAA 84 60 

CTTGGTAACC TCCTTCTTGC CCAAGAAGAA ATCTCTCACT CATATCCATT TGACTGGCGT 852 0 

ACTAAGAAAC CAATCATCTG GCGTGCAGTT CCACAATGGT TTGCCTCAGT TTCTAAATTC 8580 

CGT C AAG AAA TCTTGGACGA AATTGAAAAA GTGAAATTCC ACTCAGAATG GGGTAAAGTC 8 64 0 

CGTCTTTACA ATATGATCCG TGACCGTGGT GACTGGGTTA TCTCTCGTCA ACGTGCTTGG 87 00 

GGTGTTCCAC TTCCTATCTT CTACGCTGAA GATGGTACAG CTATCATGGT AGCTGAAACT 87 6 0 

ATTGAACACG TAGCTCAACT TTTTGAAGAA TATGGTTCAA GCATTTGGTG GGAACGTGAT 882 0 

GCCAAAGACC TCTTGCCAGA AGGATTTACT CATCCAGGTT CACCAAACGG CG AGTT C AAA 8880 

AAAGAAACTG ATATCATGGA CGTTTGGTTT GACTCAGGTT CATCATGGAA TGGAGTGGTG 8 94 0 

GTAAACCGTC CTGAATTGAC TTACCCAGCC GACCTTTACC T AG AAGGTT C TGACCAATAC 9000 

CGTGGTTGGT TTAACTCATC ACTTATCACA TCTGTTGCCA AC C ATGGC GT AGCACCTTAC 9060 
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AAACAAATCT TGTCACAAGG TTTTGCCCTT GATGGTAAAG GTGAGAAGAT GTCTAAATCT 912 0 

CTTGGAAATA CTATTGCTCC AAGCGATGTT GAAAAACAAT TCGGTGCTGA AATCTTGCGT 9180 

CTCTGGGTAA CAAGTGTTGA CTCAAGCAAT GACGTGCGTA TCTCTATGGA TATCTTGAGC 92 4 0 

CAAGTTTCTG AAACTTACCG TAAGATTCGT AACACTCTTC GTTTCTTGAT TGCCAATACA 9300 

TCTGACTTTA ACCCAGCTCA AG AT AC AGT C GCTTACGATG AGCTTCGTTC AGTTGATAAG 9 3 60 

TACATGACGA TTCGCTTTAA CCAGCTTGTC AAG AC C ATT C GTGATGCCTA TGCAGACTTT 9420 

GAATTCTTGA CGATCTACAA GGCCTTGGTG AACTTTATCA ACGTTGACTT GTCAGCCTTC 9480 

TACCTTGATT TTGCCAAAGA TGTTGTTTAC ATTGAAGGTG CCAAATCACT GGAACGCCGT 9540 

CAAATGCAGA CTGTCTTCTA TG AC ATT CT T GTCAAAATCA CCAAACTCTT GACACCAATC 9600 

CTTCCTCACA CTGCGGAAGA AATCTGGTCA TATCTTGAGT TTGAAACAGA AGACTTCGTC 96 6 0 

CAATTGTCAG AATT AC CAG A AGTTCAAACT TTTGCTAACC AAGAAGAAAT CTTGGATACA 972 0 

TGGGCAGCCT TCATGGACTT TCGTGGACAA GCACAAAAAG CCTTGGAAGA AGCTCGTAAT 97 80 

GCAAAAGTTA TCGGTAAATC ACTTGAAGCA CACTTGACAG TTTATCCAAA TGAAGTTGTG 984 0 

AAAACT CT AC TCGAAGCAGT AAACAGCAAT GTAGCACAAC TTTTGATCGT GTCTGAGTTG 9900 

ACCATCGCAG AAGGACCAGC TCCGGAAGCT GCCCTTAGCT T CG AAG AT GT AGCCTTCACA 9960 

GTTGAACGTG CTACTGGTGA AGTATGTGAC CGTTGCCGTC GTATCGACCC AACAACAGCA 10020 

GAACGCAGCT ACCAGGCAGT TATCTGTGAC CACTGTGCAA GCATCGTAGA AGAAAACTTT 10080 

GCGGAAGCAG TCGCAGAAGG ATTTGAAGAG AAAT AAG AT T GAAAAGTCTA GGCAAAATTC 1014 0 

AATTTGAGAA GAAAAGACAA CTAATTTTAT AGTCTATTAA ACGCATTGTA TCACGTTTTT 1020 0 

GAATACCTGA TATGATGCGT TTTTTATTTA TTTTAAAAAT TTGCGAGGTA TGACTTTTTA 10260 

TACTCAACAA GAATCAAAGA GAAACTTAGC AAGCTAACAG TAGTAAGATA AAATAGGAAT 10320 

TTGATATTAG GG AT AAG AT T GGT AAAT AGT GTAATATTTT TACAACAATA AATTTATATA 103 80 

GTTATTTCTG GTTTCTGAAA AG T ATT AT AT TTTATTTCAT AT TAT AC AAA TTTTTATTTT 1044 0 

ATAATATCAG AACATACTTT TTTTAAAAGC AAATATGATA CAATTTTATT TGAAAAAAAT 10500 

AAAAAAGGAG ATTTTATTAT AAAATTAAAA AGACTTGCTT TAATTAGTGG TATCGTCGGT 105 60 

CTTGTGGGAG GAATTTTACT TCTTATTGGT CCTTTTGTCT TGTTGGGAAT AGCGGTAAAC 10620 

ACAGCTGCTA CAACTCTTAA TGGAGGAGCT ACTGCAGGGG CTTTTTCAGG TGTAGCCTTA 10680 

CTCTTGAATG CCTTGAAGAT TGCAAATCTT GTTCTTGGTA TCATTGCTAT TGTTTACTAT 1074 0 

AAAGGAGATA AGCGTGTAGG TGCAGCTCCG TCTGTACTAA TGATTGTTTC TGGTGGAGTT 10800 

AGTCTCATTC TATTCCGTTC TTAGGATGGG TTGGGGGGAT TTTTGCTATT AT CGG AGGAT 108 60 
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CTCTATTCCT TTCAACATTG AAGAAATTCA AATCAGAAGA ATAAAAGGTA TTTTAGCATG 10920 

AAAAGAACAA AAAAGTTTAT CGGTATAGGA GTAGCTCTAT TATCTCTTTC TCTTCTAGTT 109 8 0 

GCATGTGGAA CATAAAGTTC AAAGAATACT TCAACAAGTA ATGATGAGAA GACAGTAGCA 11040 

ACATCCAATA GTT C AAAAG A AACAATCACT TTCGATACAC CGGTTGTAAC AGACGATGCG 11100 

ATTGAATCAA TACGCACTTA T G C AG ATT AT AT AG AT CTTT ATAAAAATAT TTTTGATGAT 11160 

TATTTTACTA AAGCTGAGGA AGGTTTCAAA GGCATAGCTA TGGAAAATAA TGACTCGTTT 11220 

ACTAAACTAA AAGAGTCAAC TCAAAAATTA TTCGATGCGC AGAAAAAAAG GTTAAATAAT 112 80 

GAAGATAGAA TAGAAACAAC CAAAAACAAT GTGATTGCCA AACATTGTCA AACAGTCCTT 11340 

TCCTTTTTGG TTTTGACTAG CTTTTTTGTG AAAAATTGTG TAAAATAGAA T AG AT AAAC G 11400 

AGGGGAAACC TCGGAAAATT TAAAGGAGAA TCCATCTAAT GGTAAAATTG GTTTTTGCTC 114 60 

GCCACGGTGA GTCTGAATGG AACAAAGCTA ACCTTTTCAC TGGTTGGGCT GATGTTGATT 1152 0 

TGTCTGAAAA AGGTACACAA CAAGCGATTG ACGCTGGTAA ATTGATCAAA GAAGCTGGTA 11580 

TCGAATTTGA CCAAGCTTAC ACTTCAGTAT TGAAACGTGC TATCAAAACA ACTAACTTGG 11640 

CTCTTGAAGC TTCTGACCAA TTGTGGGTTC CAGTTGAAAA ATCATGGCGC TTGAACGAAC 117 00 

GTCACTACGG TGGTTTGACT GGTAAAAACA AAGCTGAAGC TGCTGAACAA TTTGGTGATG 117 60 

AGCAAGTTCA CATCTGGCGT CGTTCATACG ATGTATTGCC TCCAAACATG GACCGTGATG 1182 0 

ATGAGCACTC AGCTCACACA GACCGTCGTT ACGCTTCACT TGACGACTCA GTTATCCCAG 11880 

ATGCTGAAAA CTTGAAAGTG ACTTTGGAAC GTGCTCTTCC ATTCTGGGAA GATAAAATCG 11940 

CTCCAGCTCT TAAAGATGGT AAAAACGTAT TCGTAGGAGC TCACGGTAAC TCAATCCGTG 12 0 00 

CCCTTGTAAA AC AC AT C AAA GGTTTGTCAG ATGACGAGAT CATGGACGTG GAAATCCCTA 12 060 

ACTTCCCACC AT TGGT ATT C GAATTCGACG AAAAATTGAA CGTCGTTTCT G AAT ACT AC C 1212 0 

TTGGAAAATA AAAAATTGTA AGT C TAG AAT TGATTTCTAG GCTTTTTATG TTAGTATGGA 12180 

AGTATGATAA GGAATAAAAA ACAAGATTAT GTACTGGCCT ACAAGCAACC AGCTTCAACC 1224 0 

ACTTACATGG GTTGGGAAGA AGAAGCTTTA CCGATAGGCA ATGGTTCTTT AGGAGCAAAA 12 300 

GTATTTGGCC TTATAGGGGC TGAACGGATT CAATTTAATG AAAAAAGTCT CTGGTCTGGA 12 3 60 

GGTCCACTTC CTGATAGTTC AGATTATCAG GGTGGAAATC TTCAGGATCA GTATGTTTTT 1242 0 

TTAGCTGAGA TTCGGCAGGC TTTGGAGAAG AGAGATTACA ATCTGGCTAA GGAACTGGCT 124 80 

GAGCAGCACC TAATTGGGCC AAAAACGAGT CAATATGGGA CCTATCTGTC TTTTGGGGAT 12 54 0 

ATTCACATTG AGTTCAGCCA GCAAGGTACG ACTTTGTCTC AGGTGACGGA CTATCAGAGA 12600 



WO 98/18931 



PCT/US97/19588 



366 



CAGCTGAATA 


TTAGTAAGGC 


ACTTGCGACG 


ACTTCTTATG 


TCTATAAGGG 


AACGCGATTT 


12660 


GAACGTAAAG 


CTTTTGCGAG 


TTTTCCAGAT 


GATCTCTTGG 


TTCAATGTTT 


TACTAAGGAA 


12720 


GGGTTGGAAA 


CTCTAGATTT 


TACTATAGAA 


CTATCCTTGA 


CCTGTGATTT 


GGCTTCTGAT 


12780 


GGAAAGTATG 


AGCAGGAAAA 


ATCTGATTAC 


AAGGAGTGTA 


AGTTGGATAT 


TACTGATTCT 


12840 


CATATCTTGA 


TGAAGGGAAG 


AGTTAAGGAT 


AATGATCTGC 


GGTTTGCTAG 


TT AT C TAG C T 


12900 


TGGGAAACGG 


ATGGAGATAT 


TAGAGTTTGG 


TCAGATAGGG 


TT C AG AT AT C 


AGGAGCCAGT 


12960 


TATGCCAATC 


TCTTCTTGGC 


CGCTAAGACG 


GATTTTGCCC 


AAAATCCTGC 


TAGCAATTAT 


13020 


CGCAAGAAAC 


TAGATTTAGA 


GCAACAGGTG 


ATAGACTTGG 


TGGACACAGC 


TAAAGAAAAG 


13080 


GGCTATACCC 


AATTGAAATC 


AAGGCATATC 


GAGGACTACC 


AAGCCTTATT 


CCAGCGTGTT 


13140 


CAATTGGATT 


TGGAAGCTGA 


TGTTGACGCA 


T C C AC T AC AG 


ATGATTTGTT 


AAAAAATTAT 


13200 


AAGCCACAAG 


AAGGGCAGGC 


TTTGGAGGAG 


CTGTTCTTCC 


AGTATGGACG 


GT AT T TAT T G 


13260 


ATTAGTTCGT 


CC AG AG ACT G 


CCCAGATGCT 


CTACCAGCTA 


ACCTACAGGG 


AGTCTGGAAT 


13320 


GCGGTCGACA 


ATCCTCCTTG 


GAATTCGGAC 


TATCACTTAA 


ATGTCAATCT 


GCAGCTGAAT 


13380 


TATTGGCCAG 


CCTATGTTAC 


CAATCTCCTA 


GAGACGGTCT 


TTCCAGTCAT 


CAACTATGTA 


13440 


GATGATTTGC 


GTGTCTATGG 


TCGTCTAGCG 


GCTGTAAAGT 


ATGCAGGAAT 


CGTCTCTCAG 


13500 


AAAGGTGAGG 


AGAATGGTTG 


GTTGGTTCAT 


ACTCAAGCGA 


CTCCCTTTGG 


TTGGACGGCA 


13560 


CCTGGTTGGG 


AT TACT ATT G 


GGGTTGGTCA 


CCAGCTGCCA 


ATGCGTGGAT 


GATGCAAACC 


13620 


GTTTATGAAG 


CCTATTTATT 


TTATAGGGAC 


C AAG ACT AT C 


TCAGGGAGAA 


AATTTATCCC 


13680 


ATGTTGAGGG 


AAACGGTTCG 


TTTTTGGAAT 


GCCTTTTTAC 


ATAAGGATCA 


GCAGGCGCAG 


13740 


CGTTGGGTGT 


CTTCTCCGTC 


TTATTCCCCA 


GAACATGGGC 


CGATTTCGAT 


TGGCAATACC 


13800 


TATGACCAAT 


CTCTGATTTG 


GCAGTTATTT 


CATGATTTTA 


TTCAGGCTGC 


TCAGGAATTG 


13860 


GGACTGGATG 


AGGACTTGTT 


GACTGAGGTT 


AAGGAGAAGT 


CTGATTTACT 


AAATCCTTTG 


13920 


CAAATCACTC 


AATCTGGTCG 


AATCAGGGAG 


TGGTATGAGG 


AGGAAGAGCA 


GTATTTTCAA 


13980 


AATGAGAAAG 


TGGAGGCCCA 


GCATCGGCAC 


GCTTCCCATC 


T AGTGGG AC T 


CTATCCTGGC 


14040 


AATCTCTTTA 


GCTACAAGGG 


ACAAGAGTAT 


ATTGAAGCGG 


CGCGTGCTAG 


CCTCAATGAT 


14100 


CGTGGAGATG 


GCGGCACAGG 


CTGGTCCAAG 


GCTAATAAGA 


TCAATCTCTG 


GGCGCGTTTG 


14160 


GGAGATGGCA 


ATCGAGCCCA 


TAAATTATTG 


GCAGAGCAGT 


T AAAG AC AT C 


CACCTTGCAA 


14220 


AATCTTTGGT 


GTAGCCATCC 


TCCTTTTCAG 


ATAGATGGTA 


ATTTTGGTGC 


TACTAGTGGC 


14280 


ATGGCAGAAA 


TGTTACTCCA 


GTCTCATGCA 


GCTTATCTGG 


TACCTCTAGC 


TGCCCTACCT 


14340 


GATGCTTGGT 


CAACAGGTTC 


TGTTTCAGGC 


TTAATGGCAC 


GTGGACATTT 


TGAAGTGAGC 


14400 



WO 98/18931 



PCT/US97/19588 



367 



ATGAGCTGGG 


AAGATAAAAA 


ACTCTTACAG 


TTGACCATTT 


TATCAAGGAG 


TGGAGGAGAT 


14460 


TTGCGAGTTT 


CTTATCCAGA 


TATTGAGAAG 


AGTGTGATTA 


AAATGAATCA 


AGAAAAAATA 


14520 


AAAGCGAAAT 


GCATGGGGAA 


AGATTGTATT 


TCGGTGGCAA 


CAGCAGAAGG 


TGATCTTGTT 


14580 


CAATTTTATT 


TTTAAGAAGA 


TGTTATAAGG 


CAGTAATTTG 


AAACTGCCTT 


TTAATAAGGA 


14640 


TTTAAGAATA 


TAAGCAGTTT 


TCAACTAGTT 


GAAAAAACGT 


TATAATGATA 


ATAGGAAGTA 


14700 


ATACTCAATG 


AAAATCAAAG 


AGCACAAACT 


AGGAAGCTAG 


CCGCAGGTTG 


CTCAAAACAG 


14760 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGT 


GGTTTGAAGA 


GAGATTTTCG 


AGGAGTATAA 


14820 


TTTGTTTGAT 


AGAGGGTGGG 


TCTGATGGCT 


TATATTGAGA 


TGAAACACTG 


TTACAAGCGT 


14880 


TATCAGGTTG 


GGGACACGGA 


GATTGTGGCC 


AATTGTGATG 


TGAATTTTGA 


GATTGAAAAG 


14940 


GGGGAGCTGG 


TTATTATCCT 


TGGTGCTTCA 


GGTGCAGGCA 


AGTCAACAGT 


TCTTAACCTT 


15000 


CTTGGGGGAA 


TGGATACCAA 


TGATGAAGGG 


GAAATCTGGA 


TTGATGGTGT 


TAATATTGCG 


15060 


GAT TAT AG T T 


CCCACCAGCG 


CACCAATTAC 


CGTAGAAATG 


ATGTGGGGTT 


TGTTTTTCAG 


15120 


TTTTATAATC 


TAGTTTCTAA 


TCTGACAGCT 


AAGGAAAATG 


TGGAACTGGC 


TTCTGAAATT 


15180 


GTGACAGATG 


CCTTGAATCC 


TGATCAGGCC 


TTGACAGATG 


TAGGTCTGGC 


TCATCGTCTC 


15240 


AATAACTTTC 


CAGCCCAGCT 


TTCTGGAGGG 


GAGCAACAGC 


GAGTCTCCAT 


TGCACGCGCG 


15300 


GTAGCCAAAA 


ATCCTAAAAT 


TCTCCTTTGT 


GATGAACCGA 


CTGGAGCCTT 


GG ATT AT C AG 


15360 


ACGGGCAAGC 


AGGTTTTGAA 


AATTCTCCAA 


GACATGTCTC 


GTCAAAAGGG 


AGCGACGGTG 


15420 


ATCATCGTGA 


CTCATAATGG 


AGCTTTGGCG 


CCCATTGCTG 


ATCGCGTGAT 


TCAAATGCAC 


15480 


GATGCCAGTG 


TCAAGGATGT 


GGTGCTCAAC 


CAGCATCCTC 


AGG AT ATT G A 


CAGTTTGGAG 


15540 


TACTAGCATG 


ATCAAGCGAA 


AAACTTATTG 


GAAGGACTTA 


GTTCAGTCCT 


TCACAGGCTC 


15600 


CAAGGGGCGT 


TTTTTATCCA 


TCTTGATCCT 


GATGATGTTG 


GGATCTCTAG 


CCTTAGTAGG 


15660 


CCTCAAAGTA 


ACCAGTCCCA 


ACATGGAGGC 


GACAGCTAAT 


GCTTATTTAA 


CAACTGCTCA 


15720 


AACCTTGGAT 


TTGGCAGTCA 


TGTCTAACTA 


TGGCTTGGAT 


CAAGCAGACC 


AAGAAGAACT 


15780 


AAAACAGACG 


GAGGGCGCAG 


AGGTCGAGTT 


TGGCTATTTG 


ACAGATGTGA 


CTATGGATAA 


15840 


TGGGCAGGAT 


GCCATTCGGC 


TGTACTCCAA 


AC C AG AGCG A 


ATTTCAACCT 


TTCAGCTAAG 


15900 


AAAGGGACGA 


CTTCCTCAGT 


CAGACAAGGA 


AATCGCTTTG 


GCCACTCATT 


TGCAAGGCCA 


15960 


ATACAGCGTG 


GGACAGGAGA 


TTAGTTTTAA 


AGAAAAAGAA 


GAGGGTCATT 


CCTCTTTAAA 


16020 


AGACCATACT 


TATACCATTA 


CTGGTTTTGT 


GGATTCGGCT 


GAAATCCTCT 


CCCAGCGAGA 


16080 


TATGGGCTAC 


GCAGGAAGTG 


GAAGTGGGAC 


TCTGACAGCC 


TATGGGGTGA 


TTTTACCTAG 


16140 
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TCAATTTGAT CAGAAAGTCT AC AAT AT AG C TCGTTTGAAA TATCAAGATT TAGCGGGTTT 16200 

AAATGCCTTT TCATCAGCTT ATGAAGAAAA ATCCAAGCAA CATCAAGAAG AGCTTGAACA 162 60 

AATTTTATCA GATAATGGCA AGGTACGTCT GCAACTTTTG AAAAAAGAAG GACAAGAGTC 16320 

TCTAGACAAG GGGCAAGAGA CCCTTGACAA GGCTCAGACT AATTTGCAGG AAGGCAAGCG 16380 

TCGTTTAGCA GCTGCTCAAG CTCGTATACA GGCTCAAGAA AGTCAACTAG CCTTGTTTCC 16440 

TCAAGTTCAG AGAGAGCAGG CTAGTGCTCA ACTTACCCAA GCCAAGCAGG AATTGGGCAA 16500 

GGAAGAGGAC AAACTAAAGC AAGCTGAACA AAATCTAGCC CAAGAAAAGG AAAAATTAGA 165 60 

AAAACATCAG CAAGTCTTGG ATGATTTGGC GGAGCCAAGG TATCAGGTTT ATAATCGTCA 16 620 

G AC CAT G C C A GGTGGTCAGG GCTATCTTAT GTATAGCAAT GCTTCATCCA GTATTCGAGC 16680 

AGTGGGCAAT ATCTTTCCTG TGGTACTTTA TGCCGTAGCA GCCATGGTGA CCTTTACGAC 16740 

CATGACTCGC TTTGTAGACG AAGAGCGAAC TCATGCAGGG ATTTTTAAGG CCTTGGGTTA 16800 

TCGTAGTAAG GATATTATCG CCAAGTTTCT CCTTTATGGA CTAGTAGCTG GGACTGTCGG 168 60 

AACGGCTCTA GGTAGTATAC TTGGTCATTA TTTGCTAGCC AGTGTAATTT C AAGTGT C AT 1692 0 

TACAAAAGGC ATGGTGGTGG G AG AAACT C A G ATT C AG T TC TATTGGACCT ATAGCTTACT 16980 

AGCTTTTGTC TTGAGCTTGT TGGCGAGTGT GTTACCAGCC TATCTGGTGG CTTGGAGGGA 17 04 0 

ACTTCATGAC GAAGCAGCCC AGCTTCTACT TCCTAAACCT CCTGTCAAAG GAGCTAAAAT 17100 

CTTATTGGAG CGTATCGGTT TTATCTGGCG TCGTCTCAGT TTTACTCATA AGGTAACAGC 17160 

CCGCAACATC TTTCGTTATA AGCAGAGAAT GTTGATGACA ATCTTTGGTG TGGCAGGTTC 17220 

TGTAGCTCTG CTCTTTGCAG GTTTGGGAAT CCAATCTTCT GTAGCAGGAG TTCCGTCTAA 172 8 0 

ACAGTTTCAA CAAATCCAAC AGTATCAGAT GCTTGTCTCT GAAAATCCTA GTGCGACCAA 17 340 

TCAGGACAAG GTAGAGCTAG CAGAAGTGTT GAAAGGGCAG GAGATACTAG CCTACCAGAA 174 00 

AATCTATTCT AAAGCGCTAT ACAAGGATTT CAAAGGCAAA GCTGGTCTTC AAAAC AT T AC 174 6 0 

TCTTATGATG ATAGAGAAGG AAGATTTGAC TCCCTTTATC CATCTTCAAC ATCATCAGCA 17 520 

GGAGCTGACA TTAAAAGATG GCATCGTTAT T AC AG CT AAA CTCGCCCAGC TGGCAGGTGT L7 580 

CAAGGTTGGG CAGACTTTAG AAATTGAAGG TAAGGAACTA AAGGTCGTTG CT ATT AC T G A 17 640 

GAACTACGTT GGTCACTTTA TTTATATGAG TCAGGCTAGC TATGAGCAAC TTTACGGACA 17700 

GCTACCCCAA GCCAACACTT ATCTGGTCTC ATTAAGGGAT ACCAGTGCAA CTAGTATCGA 177 60 

AAGTCAGGCG GGCTTGCTTA TG AAT C AAT C TGCGGTGTCC AGCGTTGTCC AAAATGCTTC 17 82 0 

AGCCATTCGA CTCTTCGACT CTATCGCTAG CTCACTCAAT CAGACCATGA CCATCTTGGT 17880 

CATCGTATCG GTTCTATTAG CTATTGTCAT CCTTTACAAT CTGACCAATA TCAACGTAGC 17 940 
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TGAGAGAATC 


CGTGAACTCT 


CCACTATCAA 


GGTTCTTGGT 


TTTCATAATA 


ATGAAGTCAC 


18000 


CCTCTACATT 


TACCGTGAGA 


CGATTGTGCT 


GTCCCTTGTG 


GGAATCGTAC 


TTGGTCTGAT 


18060 


AGCTGGTTTC 


TATTTACACC 


AATTTTTGAT 


TCAAATGATT 


TCGCCTGCGA 


CTATTCTCTT 


18120 


TTATCCGCAG 


GTAGGCTGGG 


AAGTCTATGT 


AATCCCAGTG 


GCAGCAGTAA 


GCATCATTTT 


18180 


GACCTTGCTT 


GGTTTCTTCG 


TCAATTATTA 


TCTGAGAAAG 


GTTGATATGT 


TAGAAGCCCT 


18240 


GAAATCTGTA 


GAGTAAGGTA 


GTTATTTTTA 


GCTGATTGAA 


CTTCTATTTA 


CT AAT ATT C A 


18300 


AAAATCCTCC 


GTTTCAAAGA 


GCAGGGAACT 


CTTTGTGACA 


GAGGATTTTT 


TCTATAGGGC 


18360 


TTTAGCAGCT 


GCAATTGCGG 


CTTCGAAGTT 


TGGCTCAGAA 


TTG AT AT T AT 


CCACGTATTC 


18420 


AACGTAGCGA 


ATCGTATTGT 


CAGTATCGAG 


GACAAAGACT 


GCGCGTGCTA 


ATAGGTGCCA 


18480 


TTCGTTGATC 


AAGAGGGCAT 


AATCGCGCCC 


GAAAGAATGG 


TCAAAGTAGT 


CTGAAAGCAT 


18540 


AATGGCATTG 


TCAAGGCCTT 


CAGCACCGCA 


CCAACGTTTT 


TGAGCAAAAG 


GTAGGTCCAT 


18600 


TGAAACAGTC 


AATACGACCG 


TGTTGTCCAG 


TCCAGCCAAT 


TCTTCATTAA 


AACGACGTGT 


18660 


TTGAGTTGAG 


CAGATGCCTG 


TATCGATAGA 


AGGAACGACA 


CTCAAGACTT 


TTTTCTTGCC 


18720 


ATCAAAATCA 


GCCAGAGATT 


TTTTAGAAAG 


ATCTGTTGTA 


GTAAGAGAAA 


AATCAAGCGC 


18780 


CTTGTCGCCG 


ACTTGTAGTT 


GTTTACCTGT 


AAAGCTCACA 


GGATTTCCGA 


GAAAAGTTAC 


18840 


CATAGGATAC 


TCCAATCTTT 


TTTCTTCCAT 


TTTAGCTGAA 


ACAGTCGGAA 


TTTTCCAATG 


18900 


ATTTGACCGG 


AAATATGGGC 


ATAGAAAAAA 


CGCCAGCTCA 


TGTGAGAATG 


ACGTTTTTCA 


18960 


TAGGTTTATT 


TTGCCAATCC 


TTCAGCAATC 


TTGTCAAGGT 


TGTATTTCAT 


CAT G C TGT AG 


19020 


TAGCTGTCGC 


CTTCTTTACC 


TTGTTCTGCG 


AT AG AGT C AG 


TAAAGATTTG 


AGCGTAGATT 


19080 


GGGATGTTTG 


TGTCTTGAGA 


AACAGTTTTC 


ATTGGACGGT 


CATCCACACT 


TGATTCTACA 


19140 


AAGAGTGATG 


GAACTTTTGT 


TTGGCGAAGT 


TTTTCAACCA 


AGGTCTTGAT 


TTGTTCAGGA 


19200 


GTTCCTTCTT 


CTTCAGTATT 


GATTTCCCAG 


ATGTAAGCAC 


TTGGGACACC 


ATAGGCTTTA 


19260 


GAGAAGTATT 


TGAATGCTCC 


TTCGCTGGTT 


ACAATGAGTT 


TCTTTTCAGC 


AGGGATCTTA 


19320 


TTAAATTTAT 


CCTTACTTTC 


TTTATCAAGT 


TTGTCTAACT 


TATCAGTATA 


TTCTTTGAGA 


19380 


TTTTTTTCAT 


AGAATTCTTT 


ATTGTTAGGG 


TCTTTGGCGC 


TCAATTGTTT 


GGCGATATTT 


19440 


TTAGCAAAAA 


TAATACCGTT 


TTCAAGGTTA 


AGCCAAGCGT 


GTGGGTCTTC 


TTTTCCTTTT 


19500 


TCATTTTGAC 


CTTCAAGGTA 


G AT AAC AT C A 


ACGCCGTCGC 


TGACTGCGAA 


GT AGT CT T T G 


19560 


TTTTCAGTTT 


TCTTGGCATT 


TTCTACCAAT 


TTTGTAAACC 


AAGCATTGCC 


ACCTGTTTCA 


19620 


AGGTTGATAC 


CGTTATAGAA 


AAT C AAATT A 


GCCTCAGAAG 


TTTTCTTAAC 


GTCTTCAGGA 


19680 
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AGTGGTTCGT ATTCGTGTGG GTCTTGCCCA ATCGGAACGA TACT AT G AAG GTCAATTTTG 1974 0 

TCACCAGCAA TATTTTTAGT AATATCAGCG ATGATTGAGT TTGTAGCAAC AACTTTTAGT 19800 

TTTTGACCAG AAGTTGTATC TTTTTTTCCG CTAGCACATG CTACAAGAAT GATTGCAGAA 198 60 

AGAAAGAGAA CGAGTAATGT ACCTAATTTT TTCATTAGAT CCTCCAATTT ATTAGGGCTT 1992 0 

TGCCCCTTAT TTTAACAAAT GTTTATTTTT CAGTTTCAAA TATCGTTGTT TGGGAGCGAT 19980 

AAAGAAGCTA ATGAGAAAGA AACTAGCAGC TGTAAGCACG ATACTAGAAC CTGCCGCAAC 2 0 040 

ATTAAAACTA TAGCCAATAA AGAGTCCCAA AACTGAAGCA GTAGCTCCGA AGGTTGAGGA 20100 

AAGGAAAATC ATACTTTTCA GACTATTAGC ATACAGATAA GCAGTTGCAG CTGGGGTAAT 2 0160 

CAGCATGGCT ACAATCAGGA TAGTTCCGAC ACTTTGCATG GCTGTCACAG AC ACG AG AG T 2 0220 

CAGGAGTACC ATGAGAAGGT AGTGATAGAA ATTGACAGGC ATTCCCATGG CTTTAGCCAA 20280 

GAGTTCATCA AAGGAAGTTA TCAAGAGTTG CTTGAAGAAA AT C C AG ATT A ACAAGAGGAT 2 0340 

AGCTGCCCCC ACACCCATAG TAATAAACAT ATCCGTATCT TGGACGGCCA GG AT ATT AC C 2 0400 

AAAAAGGATA TGGAAAAGGT CAGTTGAACT TTTAGCGACA CCAATCAAGA TGATACCGAG 2 0460 

GGCTAAGAAA GAAGAAAAGG TAATGCCGAT GGCGGTATCG CTTTTGATAA TCGAGTTTCC 2 052 0 

TTTGATGTAG GTAATGATGA TGGCAGCTAG CAATCCAAAG ACAATGGCTC CGATAAAGAA 2 05 80 

GTCAAGGCCC AAGATGAAGG ATAGGGCTAC ACCTGGTAAG ACAGCATGTG AAATGGCATC 2 0640 

TCCCATGAGT GACATCCCGC GTAGAATAAT GAAACATCCC ACAGCTCCAG CTACAATCCC 2 07 00 

GACGACAATA GCTGTTATCA AGGCATTTTG TAGGAAATGG AATTTTTGCA ATCCATCGAT 2 07 60 

AAATTCTGCA ATCATAGGTC ACCTCCATTG AAAAAGAGTT G ATT AC CGT A AGCTTCTTTT 2082 0 

AGATTGGTTT CGGTAAAAGT TTCTTTTGTT GGACCAAAGG CAATCACTTC TCGATTGACA 2 0880 

AGTAAGACTT GATCGAAGTA GTGGGGAATC TTGCTGAGGT CGTGGTGAAC GATGAGAACC 2 094 0 

GTCTTCCCAG CTTTTTTCAA ATCTCTCAGC GTATTCATGA TGATTTCCTC ACTGACAGAG 21000 

TCAATCCCAG CAAAGGGTTC ATCCAAGAGG ATATAGTCGG CTTCCTGCAC CAAACATCTG 21060 

GCAATCAAGA CCCGCTGGAA TTGACCTCCA GACAGTTGAC TAATTTGACG TTCAGCGTAG 21120 

TCAGCTAGGC CGACGATTTC AAGGGCCTCT TGCACTTTCT TCCAATGTTT AGCCTTTAAA 21180 

CTTCGAAAGA GAGGAATAGA GGGAAATAGT CCTAACGAGA CGCATTCCTT GACCTTGATG 2124 0 

GGAAAGTTGT AGTCGATATT GATTTTTTGT TCGACATAGG CAATTCGGTG TAAGGATTTT 213 00 

TTAACTTCCT TGTCATCGAG AAATGCCTGA CCTTGATGTG GGATAATTCC CAACATACCT 213 60 

TTTAATAGTG TTGATTTCCC AG CGCCGTTT GGACCAATGA TGCCGGTAAT TGTTGGTCCA 2142 0 

TGGAGCACTA GTGAAATATC CTTAAGTGCC AACGTTTCTT TGTAGGAGAC ACTGAGGTTT 214 80 
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TCGATACGTA TCATAAACTT GTATTCCTCC TGTCTCTTAA TATACATTAA AAAAAAAATT 21540 

AAGTCAAGTT AATTTTTGAA AAAATTAAAA TAATAACTGA AAAATAGATT CTAAAGATAA 21600 

CTTTCAGGAT AAATTTCTAA ATTATAAAAC GCATAGTATC AAGTGTAAAA AACTTGGAAT 21660 

TATGCGTTTT AT C ATGG AAA GATTTTTTAT AATAGCTAAA AAATAA 21706 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GATCCCCAGG AAAAAC CG AG GTTTTCCCAA TCAATCGTTA CTGTCATATT CCACTCCTTA 60 

TTCTAAAAAC CTATTTCTTA TATTCTACAC TATTTTTCTA AAATAGCAAG TATATTTTGT 12 0 

AATTTTCAGA AAATTTCTCC AAT AAAAAC C AACTCTTAGA ACTGATTCTT CATTTCACTT 180 

ATTTATCTTC AGTAACTACT TCCTGAAGAT AAGCGTCAAA AACTTCTTCA TCTGAAATCG 240 

TGTCAGAAAT GAAGCTTCCA TTGCTAGTGC GTTCTGACAA GTTCAAGTCT TGCAATCGGC 3 00 

TTTCATAGAT TGTTCCTTTA TTGGATTGGA CAAGCAGAGT TTGGTCGTTC ACATCCACTT 3 60 

CCGTACTGAA GAAATCGCCA ACAAATCCTT GCTCTGCAAC TGCTCCTGCC AAGAAGACAC 42 0 

GATGCGGTTT GTTTTTCAAC TCACGCAAGA CTTGTAATCC TCGTTTGGCA CGGCTGGTTG 4 80 

CTAGAATTTC CTCAATGGAA ACACGTTTCA AGCTTCCACG CTGGGTCAAG AGGTAGAAGG 540 

ACGAAGTATT ACAGATAAAG CCAGATTGGA GG AC AT CAT C TTCTTTCAAA TT C AT AG C CT 6 00 

TGACACCTGC TGCCTTAGCA CCGACAACCG GAACCTCTTC GATATTGAAA CGCAGGGCAT 6 60 

AACCATTTTG ACTAACCAAG ACAACATCAT CTAGTTTAAT CGGAGCCACT GCTACAATCT 72 0 

GATCTGTATC GTCTTTGAGC TTAGCATACT TGACAGACTT AGATCTATAG GTCCGCCATG 7 80 

GAGTGAATTC TTTTCGCTCT ACCCGTTTGA TTTGACCAAG GCGAGTCACT GCAAAGTAGG 84 0 

TTGTCGCATC GTCAAACTGA TCCAGTACTT C C AC AT AAAG GATTTCTTCA TTCGTTTCAA 900 

AGT TTGTG AT GGTTTGGCTC AGATGCTCTC CGATGTCCTT CCAACGAATA TCTGCCAACT 960 

CATGGATTGG TCTGTAGATG ACATTTCCAA GACTTGTGAA CATCAAGAGG TGCTGGGTTG 102 0 

TCTTGGCAGA TTGAACAAAA ATCAAACGGT CATC AT C ACG CTTGCCAATT TCTTCCAAGG 1080 

TGGAAGCCGC AAAGGAACGT GGACTGGTAC GCTTGATGTA ACCTGCCTTG GTCACGCTGA 1140 
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CGTAGGTATC 


TTCCTCAGCG 


ATAAGACTAG 


CTGTATCAAT 


CTCAATTGCT 


TTCGCAGTGT 


1200 


CTTCTAAAGA 


ACTCAAACGA 


GGAGTTGCAA 


ATTTCTTCTT 


GACCTCACGA 


AGTTCTTTCT 


1260 


TCATGAGATT 


GTACATAGTC 


CTTTCATCAC 


CGATAATAGC 


CGC CAGCAT A 


GCAATCTTCT 


1320 


CACGAAGCTC 


TGCTTCTTCT 


TCCTGCAAGA 


CAACCACATC 


GGTATTGGTC 


AAACGGTACA 


1380 


GTTGCAAAGT 


TACGATAGCC 


TCAGCCTGTT 


CTTCCGTAAA 


ATCATAGCTA 


ACTTTGAGGT 


1440 


TTTCCTTGGC 


GTCCGCCTTA 


TTCTCAGAAG 


CACGGATAAG 


AG C AAT G AC T 


TCATCCAAAA 


1500 


TCGAAATCAC 


ACG AAT C AAA 


CCTTCGACGA 


T ATGG AG AC G 


TTTCTCAGCC 


TTTTCTTTGT 


1560 


CAAAGCGTGA 


ACGCGCCAAA 


ATCACTTCTC 


GACGGTGAGC 


GATATAGCTA 


GACAGGATTG 


1620 


GAACAATCCC 


AACCTGACGA 


GGTGTGAAAT 


TGTCAATCGC 


CACCATATTA 


AAGTTGTAGT 


1680 


TGATTTGTAG 


GTCGGTGTAC 


TTAAATAAGT 


AGTTGAGAAC 


AAGCTCAGTA 


TTAGCGTCTT 


1740 


TCTTAAGTTC 


GATAGCGATA 


CGAAGACCAT 


CACGGTCAGA 


CTC AT C ACG A 


ACCTCAGCAA 


1800 


TCCCAGCTAC 


CTTGTTATTA 


ACACGAACAT 


CATCGATTTT 


CT T G ACT AG A 


TTGGCCTTAT 


1860 


TGATTTCATA 


AGGAATCTCA 


ATAATAACGA 


TTTGTTCCTT 


ACCACCTTTT 


AGCTTTTCAA 


1920 


TTTCAGTCTT 


GGAACGAACA 


ACCACGCGCC 


CTTTCCCAGT 


CTCATAAGCT 


TTCTTGATTT 


1980 


CATCACGACC 


C TGAAT AAT A 


GCCCCTGTAG 


GGAAGTCTGG 


TCCAGGCAAG 


AATTCCATGA 


2040 


GTTTATCAAT 


CTTTGCAGTT 


GGGTGGTCAA 


TCATGTAAAC 


TGCAGCATCT 


ATGACCTCAG 


2100 


CTAAATTATG 


GGGAGGAATG 


TCTGTGGCAT 


AACCAGCCGA 


AATCCCAGTC 


GAACCATTGA 


2160 


CCAAGAGGTT 


TGGAAAGGCT 


GCTGGCAAGA 


CCGTTGGTTC 


TTTCTCCGTA 


TCGTCAAAGT 


2220 


TCCATGCAAA 


AGG AACTGT C 


TTTTTCTCGA 


TATCCTGAAG 


AAGGTAGCCT 


GCAATTTCAG 


2280 


ACAAACGTGC 


CTCAGTATAA 


CGCATAGCCG 


C AGG AGG AT C 


TCCGTCCATA 


G AACCGT T AT 


2340 


TACCGTGCAT 


TTC AAC TAG A 


ATCTCACGAT 


TTTTCCAGTT 


CTGTGACATA 


C G AAC C ATGG 


2400 


CATCATAGAT 


AGAAGAATCC 


CCGTGTGGGT 


GGAAATTCCC 


CATGATGTTC 


CCGACTGACT 


2460 


TGGCCGACTT 


ACGGTAGCTC 


TTGTCAAAAG 


TATTGCTATC 


CTTATTCATA 


GAATAAAGAA 


2520 


TACGGCGCTG 


AACCGGCTTC 


AACCCATCAC 


GAATATCTGG 


CAAAGCCCGG 


TCTTGAATAA 


2 5 80 


TGTACTTGGA 


GTAGCGACCA 


AAGCGCTCTC 


CCATGATGTC 


CTCCAGGGAC 


ATGTTTTGAA 


2640 


TGTTAGACAT 


AAGATACAAA 


GCCCATAAAA 


TACCAAGTGA 


AAATAGAAAA 


TTCTTGAAGT 


2700 


AAGCAAACTC 


ACAAGAGAAT 


TTATCTTTTT 


CACACAGTAT 


CTAGGGCGTG 


TTCAACTCCT 


2760 


TTCAAAGAAT 


GTAGAGTAGG 


TTTTTATGCA 


GTAAAAGATA 


TTTTACGGGA 


ATTCCTCCCG 


2820 


TGTTCAGTTA 


CGATAAGTAA 


CCAAACTATC 


CTGTTTGTAT 


TTTTCAATAT 


GAAAATCTGG 


2880 


TTTTCCAAAA 


TTAGTCTTAG 


TTTGTGTCTT 


AGCCGCTCCC 


TTAAGCGCCT 


CTTTGAGATA 


2940 
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AGC ACT CAT A 


GCAGATTCTT 


CATTAATAAT 


CCTGCAATTT 


TTTCAAACCA 


AGATTTTCAA 


3000 


ACTGCTTTTT 


CACATAGTCA 


TTCACATCCG 


ACTCTAATTT 


CCAGTTTACT 


AACATATTAT 


3060 


TTTCTTTCAT 


T AAAAC AC TG 


TCGTTTCTTC 


TAGCGTAAAC 


TTGACATTAT 


CTTCAATCCA 


3120 


TTTACGGCGT 


GGTTCTACCT 


TATCTCCCAT 


G AG AAC AT T G 


ACGCGGCGTT 


CGGCGCGCGC 


3180 


TAAATCTTCA 


ATTGTGACAC 


GGATGAGGGT 


ACGTGTTTCT 


GGGTTCATGG 


TTGTTTCCCA 


3240 


GAGCTGGTCC 


GCATTCATCT 


CACCAAGTCC 


TTTGTATCGT 


TGGAGGGTAG 


CGCCTTTACC 


3300 


GAACTGTTTA 


CGGAGTTCTT 


CTAGTTCTCC 


GTCCGTCCAA 


GCGTAGGCCA 


CTTCTTCTTT 


3360 


CTTGCCTTTA 


CCTTTGGACA 


TCTTGTAAAG 


AGGTGGGAGG 


GCAATATAGA 


CATGACCTGC 


3420 


CTCGACTAGC 


GGACGCATGT 


AACGGTAGAA 


AAATGTCAAG 


AGCAAGGTCT 


GGATATGGGC 


3480 


ACCGTCGGTA 


TCCGCATCGG 


TCATGATAAT 


GATCTTATCA 


TAGTTGGCAT 


CTTCAATAGA 


3540 


GAAGTCTGCT 


CCAACACCCG 


CACCAATGGT 


ATAAATCATG 


GTATTGATCT 


CTTCATTTTT 


3600 


G AGG AT AT C C 


GCCATCTTGG 


CCTTGGCTGT 


ATTGACAACC 


TTACCACGAA 


GAGGTAGAAT 


3660 


AGCCTGGAAC 


TTGCGGTCAC 


GACCTTGTTT 


GGCAGAACCA 


CCGGCAGAGT 


CCCCCTCAAC 


3720 


TAGATAGAGT 


TCATTCTTAG 


CAGGATTCTT 


AGATTGGGCT 


GGGGTCAATT 


TCCCAGACAA 


3780 


CAAGCCCTTA 


TCTTTCTTGT 


TTTTCTTCCC 


ATTTCGGCTC 


TCATCACGCG 


CCTTACGTGC 


3840 


TGCTTCACGA 


GCATCACGGG 


CCTTGATAGC 


CTTGCGGATG 


AG GTT AG AAG 


CTAATTCCCC 


3900 


ATTTTCCATA 


AGGAAAAAGG 


TCAACTTATC 


AGCCACTATT 


CCATCCACAA 


CTGGGCGAGC 


3960 


TAGGGGGCTT 


CCTAGTTTAT 


CCTTGGTCTG 


TCCTTCAAAC 


TGCAAGTGTT 


CTTCAGGAAC 


4020 


TAAGATAGAA 


AGAACGGCCG 


CTAGTCCCTC 


ACGATAGTCT 


GAACCTTCAA 


GGTTTTTATC 


4080 


TTTTTCCTTG 


AGAAGACCTG 


TTTTACGTGC 


ATAGTCATTC 


ATGACCTTGG 


TAATGGCAGA 


4140 


CTTGAGTCCT 


GTCTCGTGCG 


TTCCACCGTC 


CTTGGTGCGA 


ACGTTATTGA 


CAAAAGATAG 


4200 


AATGTTATCT 


GAGAATCCGT 


CATTGTACTG 


GAGGGCTACT 


TCCACTTGAA 


AACCATTGTC 


4260 


TTCCCCTTCA 


AAGTAAAGAA 


CTGGCGTCAA 


GATTTCCTTA 


TCTTCGTTGA 


GATAAGAAAC 


4320 


AAAATCTTGT 


ACTCCATTCT 


CATAGTGGAA 


CTCAATCGCT 


TCATTTGTTC 


GCTTGTCCGT 


4380 


TAAAGACAAG 


GTCACATTTT 


TCAAGAGAAA 


GGCTGATTCA 


TTAAGGCGCT 


CTGAAATGGT 


4440 


ATTGTACTTG 


AAATCTGTCG 


TAGAAAATAT 


AGTCGCGTCA 


GGCATAAAAG 


TAACTTTGGT 


4500 


GCCTGTTTTA 


GACTTGGGTG 


CTGTACCGAT 


TTTCTTCAAA 


GTCGTGACAG 


GTTTTCCACC 


4560 


ATTTTCGAAA 


CGTTGCTTGT 


AAACTGCGCC 


ATCACGGGTA 


ATTTCAACTT 


CTAACCAGCT 


4620 


AGAAAGGGCG 


TTAACAACGG 


AAGAACCCAC 


TCCGTGAAGT 


CCACCTGATG 


TCTTATAGCC 


4680 
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AC CTTG AC CG AATTTCCCTC CGGCATGAAG AATGGTAAAG ATAACCTCAA CAGTTGGAAT 474 0 

TCCCATAGCG TGCATACcTG TCGGCATCCC ACGTCCATGG TCTTGAACCG TTAGACTACC 4 800 

GTCTTTATTG ATAGTTACAT CAATACGATC ACCAAACCCA GACAAGGCTT CAT CG AC TG C 4860 

ATTATCAACG ATTTCCCAAA CTAGGTGATG AAGACCAGCG CCATCGGTCG ATCCAATATA 4 920 

CATCCCTGGA CGTTTTCGGA CCGCATCCAA CCCTTCTAGC ACCTGAATAG CATCATCATT 4980 

ATAATTGTTA ATATTGATTT CCTTTTTTGA CACAAGGAAC CTCCTATTCG TTCATCTTTA 5040 

CTATTCTACA GGTTTTCCAA GGATTTTGCA AAATTTTTCT TTCTCCGATG TGACAATTTC 5100 

AGCAGAGATT CTCTGCTTTT CTTTCCCAAT TCATGATATA ATAGGAGTAT GATTACAATA 5160 

GTTTTATTAA TCCTAGCCTA TCTGCTGGGT TCGATTCCAT CTGGTCTCTG GATTGGACAA 52 2 0 

GTATTCTTTC AAATCAATCT ACGCGAGCAT GGTTCTGGTA ACACTGGAAC GACCAACACC 52 80 

TTCCGCATTT TAGGTAAGAA AGCTGGTATG GCAACCTTTG TGATTGACTT TTTCAAAGGA 53 4 0 

ACCCTAGCAA CGCTGCTTCC GATTATTTTT CATCTACAAG GCGTTTCTCC TCTCATCTTT 54 00 

GGACTTTTGG CTGTTATCGG CCATACCTTC CCTATCTTTG C AG GAT T T AA AGGTGGTAAG 54 60 

GCTGTCGCAA CCAGTGCTGG AGTGATTTTC GG AT T TG CG C CTATCTTCTG TCTCTACCTT 5 52 0 

GCGATTATCT TCTTTGGAGC TCTCTATCTT GGCAGTATGA TTTCACTGTC TAGTGTCACA 5580 

GCATCGATTG CGGCTGTTAT CGGGGTTCTG CTCTTTCCAC TTTTTGGTTT TATCCTGAGT 5640 

AACTATGACT CTCTCTTCAT CG C T ATT AT C TTAGCACTTG CTAGTTTGAT TATCATTCGT 5700 

CATAAGGACA ATATAGCTCG TAT C AAAAAT AAAACTGAAA ATTTGGTCCC TTGGGGATTG 57 60 

AACCTAACCC AT CAAGAT CC TAAAAAATAA AATGCCAGTT CTGTACTGCC CCCAAACAGT 5 820 

TAGACAAATA ATTTATCCAA AGGATTTAGT TCTGTACTGC ACAGGACTAA GTCCTTTTAG 58 80 

TTTTACCTTA ATTCGTTTGT TGTTGTAGTA ATCAATATAG TCTATAATGG CTTGTTCCAA 5940 

TTGATTAAGT GATTTAAATG TTTTCTCATA GCCATAAAAC ATTTCGGATT TTAAAATGCC 6000 

AAAGAAAGAT TCCATCCTAC CGTTGTCTTG GCTGTTGCCC TTACGTGACA TGGATGCTTG 60 60 

AATTCCCTTA CTCTCTAGGA ACCGATGATA AGAATCGTGT TGGTATTGCC AGCCTTGGTC 612 0 

AC T ATGGAG A ATCGTATTCT CGTAGTGCTT CTCTGTGAAT GCCTGTTCCA A 6171 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 38: 

TATTACAAAT AAAAAAACGG AGGAGTGCTT TATGAAAGCC TATACTTATG TTAAACCAGG 60 

ACTTGCTTCT TTTGTTGATG TAGACAAACC AGTTATTCGC AAGCCAACAG ACGCTATTGT 12 0 

GCGTATTGTA AAAACCACTA TTTGTGGAAC AGACCTCCAT ATTATCAAAG GGGATGTTCC 180 

TACTTGCCAA AGTGGTACCA TTCTTGGCCA CGAAGGGATT GGGATTGTTG AAGAAGTTGG 24 0 

GGAAGGAGTT TCCAACTTCA AAAAAGGTGA CAAGGTCTTG ATTTCTTGCG TCTGTGCCTG 300 

TGGTAAATGC TACTACTGTA AAAAAGGAAT TTATGCTCAC TGTGAAGACG AAGGGGGCTG 3 60 

GATTTTCGGT CACTTGATTG ATGGTATGCA GGCTGAATAT CTACGTGTCC CTCATGCAGA 42 0 

TAATACTCTT T ACC AT ACT C CAGAAGACTT GTCAGATGAA GCTTTGGTTA TGCTGTCAGA 4 80 

CATTCTGCCT ACTGGATATG AAATTGGTGT CTTAAAAGGG AAAGTAGAAC CTGGTTGCAG 54 0 

CGTAGCCATT ATTGGTTCAG GTCCAGTTGG ATTGGCTGCT CTTTTAACAG CCCAATTCTA 60 0 

TTCACCAGCT AAATTGATTA TGGTAGACCT AGACGATAAC CGCTTGGAAA CTGCCCTATC 6 60 

ATTCGGTGCG ACTCATAAGG TTAATTCTTC AGACCCTGAA AAAGCCATTA AAGAAATTTA 72 0 

TGATTTGACA GATGGTCGTG GTGTGGATGT CGCTATCGAA GCTGTTGGTA TTCCTGCAAC 7 80 

ATTTGATTTC TGTCAAAAGA TTATCGGTGT AGACGGAACG GTTGCCAACT GTGGTGTGCA 84 0 

TGGTAAACCA GTTGAATTCG AT T T AG AT AA ACTTTGGATT CGCAACATCA ATGTAACAAC 900 

TGGTTTGGTA TCTACAAATA CGACTCCACA ATTGTTGAAA GCACTTGAAA GTCATAAGAT 9 60 

TGAACCGGAA AAATTGGTAA CTCACTATTT CAAACTCAGT GAAATTGAAA AAGCCT AC G A 102 0 

AGTCTTCAGT AAGGCAGCAG ACCACCATGC CATTAAGGTC ATT AT CG AAA ACGATATCTC 108 0 

AGAAGCCTAA GTAGTAAAAA TATTTTTGTA CATAAGTAAA TAGAAATTCA GT C AT C CAT C 114 0 

AGATGGCTGG ATTTTTTATC AAAAAATTAA GAAATGAGCA TATTTCTTTC CTTGTCTGGC 12 00 

GGAATTGGTT ATAATATACG GTACAAAGGA ATGAATGAAT ATGTATCGTG TTATAGAAAT 12 60 

GTACGGAGAT TTTGAACCGT GGTGGTTCTT AGAAGGTTGG GAAGAAGATA TTGTAGCAAG 13 2 0 

TAGAAAATTT GACCAGTATT ATGATGCTCT C AAAT AC T AC AAAACTTGCT GGTTTAGATT 13 80 

GGAACAAGAA TCGCCTCTTT ATAAAAGTAG AAGCGACTTG ATGACCATTT TTTGGGACCC 1440 

GGAAGACCAA CGCTGGTGTG ATGAATGT G A TGAGTATTTA CAACAATACC ATTCTTTGGC 1500 

TCTTTTGCAG GATGAGCAGG TTATCCCAGA CGAAAAACTA CGCTCAGGCT ATGAAAAACA 1560 

AACCAGTCAG GAAAGGAATC GTTCTTGCCG TATGAAATTA AAATAGAGAA AAGTAACTTT 162 0 

TTTGGAGTTG CTTTTTTTAT TTTTCTAACT CTTTGCGAAT AGTATAGGTG AGGAGGTAAG 1680 
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TATGGTTCAA GAAATTGCAC AAGAAATCAT TCGTTCAGCT CGGAAAAAAG GGACGCAGGA 17 40 

TATCTATTTT GTCCCTAAGT TAGACGCCTA TGAGCTTCAT ATGAGGGTAG GAGACGAGCG 1800 

CTGTAAAATT GGTAGCTATG ATTTTGAAAA GTTTGCAGCC GTTATCAGTC ACTTTAAGTT 18 60 

TGTGGCGGGT ATGAATGTGG GAGAAAAAAG ACGTAGTCAA CTGGGTTCCT GTGATTATGC 1920 

CTATGACCAT AAGATAGCGT CTCTACGTTT ATCTACTGTA GGCGATTATC GGGGGCATGA 1980 

GAGTTTGGTT ATCCGTTTGT TGCACGATGA GGAGCAGGAC CTGCATTTTT GGTTTCAGGA 204 0 

TATTGAAGAA TTAGGCAAGC AGTACAGGCA ACGGGGACTC TATCTTTTTG CTGGTCCGGT 2100 

TGGGAGTGGT AAGACGACCT TGATGCATGA ATTGTCCAAG TCACTCTTTA AAGGACAGCA 2160 

AGTTATGTCC ATCGAAGATC CTGTCGAAAT CAAGCAGGAC G AC AT GC T T C AGTTGCAGTT 222 0 

GAACGAAGCA ATCGGCCTAA CCTATGAAAA TCTAATCAAA CTTTCCTTGC GTCATCGACC 2280 

AGATCTCTTG AT T AT CGG AG AAATTCGTGA CAGCGAGACG GCGCGTGCAG TGGTCAGAGC 2340 

TAGTTTGACA GGTGCGACAG TCTTTTCAAC CATTCACGCC AAGAGTATCC GAGGTGTTTA 2400 

TGAGCGTCTG CTGGAGTTGG GTGTGAGTGA AGAAGAATTG GCAGTTGTTC TGCAAGGAGT 2 4 60 

CTGCTACCAG AGATTAATCG GGGGAGGAGG AATCGTTGAC TTTGCAAGCA GAG AT T ATC A 2 52 0 

AG AAC AC C AA GCAGCCAAGT GGAATGAGCA AATT G AC C AG CTTCTTAAAG ATGGACATAT 2 580 

CACAAGTCTT CAGGCTGAGA CGGAAAAAAT TAGCTACAGC TAAGCAAAAA AATATCATCA 2 64 0 

CCCTATTTAA CAATCTCTTT TCTAGCGGTT TTCATCTGGT GGAGACTATC TCCTTTTTAG 2 7 00 

ATAGGAGTGC TTTGTTGGAC AAGCAGTGTG TG AC C C AG AT GCGTGTGGGC TTGTCTCAGG 2760 

GGAAATCATT CTCAGAAATG ATGGAAAGTT TGGGATGTTC AAGTGCTATT GTCACTCAGT 2 82 0 

TATCCCTAGC TGAAGTTCAT GGCAATCTCC ACCTGAGTTT GGGAAAGATA GAAGAATATC 2 880 

TGGACAATCT GGCTAAGGTC AAGAAAAAAT TGATTGAAGT AGCGACCTAT CCCTTGATTT 2940 

TGCTGGGTTT TCTTCTCTTA ATTATGCTGG GGCTACGGAA TTACCTGCTC CCACAACTGG 3 000 

ATAGTAGCAA TATTGCCACC CAAATTATCG GTAATCTGCC CCAAATTTTT CTAGGCATGG 3 0 60 

TAGGGCTTGT TTCCGTGCTT GCCCTTTTAG CACTCACTTT T T AT AAAAG A AGTTCTAAGA 312 0 

TGAGTGTCTT TTCTATCTTA GCACGCCTTC CCTTTATTGG AATCTTTGTG CAGACCTACT 3180 

TGACAGCCTA TTATGCACGT GAATGGGGGA ATATGATTTC ACAGGGAATG GAGTTGACGC 3 240 

AGATTTTTCA AATGATGCAG GAACAAGGTT CCCAGCTCTT TAAAGAAGTC GGTCAAGATC 3300 

TGGCTCAAAC CCTGAAAAAT GGCCGTGAAT TTTCTCAGAC G AT AGG AAC C TATCCTTTCT 3 3 60 

TTAGGAAGGA ATTGAGTCTC AT CAT AG AGT ATGGGGAAGT TAAGTCCAAG CTGGGTAGTG 3420 

AGTTGGAAAT CTATGCTGAA AAAACTTGGG AAGCCTTTTT TACCCGAGTC AACCGCACCA 3480 
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TGAATTTGGT GCAGCCACTG GTTTTTATCT TTGTGGCACT GAT T AT CGTT TTACTTTATG 3 54 0 

CGGCAATGCT CATGCCCATG TATCAAAATA TGGAGGTAAA TTTTTAAAAT GAAAAAAATG 3 600 

ATGACATTCT TGAAAAAAGC TAAGGTTAAA GCTTTTACAT TGGTGGAGAT GTTGGTGGTC 3 6 60 

TTGCTGATTA TCAGCGTGCT TTTCTTGCTC TTTGTACCTA ATCTGACCAA GCAAAAAGAA 3 72 0 

GCAGTCAATG ACAAAGGAAA AGCAGCTGTT GTTAAGGTGG TGGAAAGCCA GGCAGAACTT 37 80 

TATAGCTTAG AAAAGAATGA AGATGCTAGC CTAAGAAAGT TACAAGCAGA TGGACGCATC 3 84 0 

ACGGAAGAAC AGGCTAAAGC TTATAAAGAA TACAATGATA AAAATGGAGG AGCAAATCGT 3900 

AAAGTCAATG ATTAAGGCCT TTACCATGCT GGAAAGTCTC TTGGTTTTGG GACTTGTGAG 3 9 60 

TATCCTTGCC TTGGGCTTGT CCGGCTCTGT CCAGTCCACT TTTTCAGCGG TAGAGGAACA 4 02 0 

GATTTTCTTT ATGGAGTTTG AAGAACTCTA TCGGGAAACC CAAAAACGCA GTGTAGCCAG 4 080 

TCAGCAAAAG ACTAGTCTGA ACTTAGATGG GCAGACGCTT AGCAATGGCA GTCAAAAGTT 414 0 

GCCAGTCCCT AAAGGAATTC AGGCCCCATC AGGCCAAAGT ATTACATTTG ACCGAGCTGG 42 00 

GGGCAATTCG TCCCTGGCTA AGGTTGAATT TCAGACCAGT AAAGGAGCGA TTCGCTATCA 42 60 

ATTATATCTA GGAAATGGAA AAATTAAACG CATTAAGGAA ACAAAAAATT AGGGCAGTGA 43 2 0 

TTTTACTGGA AGCAGTAGTC GCTCTAGCTA TCTTTGCCAG CATTGCGACC CTCCTTTTGG 43 80 

G AC AAATT C A AAAAAATAGG CAAGAGGAAG CAAAAATCTT GCAAAAGGAA GAAGTCTTGA 4440 

GGGTAGCTAA GATGGCCCTG CAGACGGGGC AAAATCAGGT AAGCATCAAC GGAGTTGAGA 4 500 

TTCAGGTATT TTCTAGTGAA AAAGGATTGG AGGTCTACCA TGGTTCAGAA CAGTTGTTGG 4 5 60 

CAATCAAAGA GCCATAAGGT CAAGGCTTTT ACCTTGTTAG AATCCCTGCT TGCCCTCATT 4620 

GTCATCAGTG GGGGATTACT CCTTTTTCAA GCTATGAGTC AGCTCCTCAT TTCAGAAGTT 4 680 

CGCTACCAGC AACAAAGCGA GCAAAAGGAG TGGCTCTTGT TTGTGGACCA ACTTGAGGTA 4740 

GAAT TAG AC C GTTCGCAGTT CGAAAAAGTA GAAGGCAATC GCCTATACAT GAAGCAAGAT 4 800 

GGCAAGGACA TCGCCATCGG TAAGTCAAAG TCAGATGATT TCCGTAAAAC GAATGCTCGT 4860 

GGTCGAGGTT ATCAGCCTAT GGTTTATGGA CTCAAATCTG TACGGATTAC AGAGGACAAT 4 92 0 

CAACTGGTTC GCTTTCATTT CCAGTTCCAA AAAGGCTTAG AAAGGGAGTT CATCTATCGT 49 80 

GTGGAAAAAG AAAAAAGTTA AGGCAGGTGT TCTCCTCTAC GCAGTCACCA TAGCAGCCAT 5040 

CTTTAGTCTT TTGTTGCAAT TTTATTTGAA CCGACAAGTC GCCCACTATC AAGACTATGC 5100 

TTTGAATAAA GAAAAATTGG TTGCTTTTGC TATGGCTAAA CGAACCAAAG ATAAGGTTGA 5160 

GCAAGAAAGT GGGGAACAGT TTTTTAATCT AGGTCAGGTA AGCT AT C AAA ACAAGAAAAC 5220 
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TGGCTTAGTG ACGAGGGTTC GTACGGATAA GAGCCAATAT GAGTTTCTGT TTCCTTCAGT 52 80 

CAAAATCAAA GAAGAGAAAA GAGATAAAAA GGAAGAGGTA GCGACCGATT CAAGCGAAAA 5340 

AGTGGAGAAG AAAAAATCAG AAGAGAAGCC TGAAAAGAAA GAGAATTCAT AGTCAATTCA 54 00 

ACT AT AATG C GTTGAATCCA GAATAGTCCA CTGTAGTTTC TAGAAAATTG CTGGAAATGG 54 60 

ATGTTAAGCT CCAATTCATT TGTTTATATC TTATTTCAGT TTACTATACT TTGTGCTAAA 552 0 

TTAAAGATAT G AAAC AT GAT TTTAACCACA AAGCAGAAAC TTTCGATTCC CCTAAAAATA 55 8 0 

TCTTCCTCGC AAACTTGGTA TGTCAAGCAG CCGAGAAACA G ATT G AT CT T CTATCAGACA 5 640 

AAGAAATTTT AGATTTCGGT GGTGGCACGG GT C TAT TAG C CTTGCCCCTA ACCCCTAGCC 57 00 

AAGCAGGCTA AGTCAGTCAC TCTTGTAGAC ATTTCTGAGA AAATGTTGGA GCAAGCTCGT 57 6 0 

TTGAAAGTGG AGCAGCAAGC AATCAAGAAT ATCCAGTTTT TGGAGCAAGA TTTACCGAAA 5820 

AATCCCTTGG AGAAAGAGTT TGATTGCCTT GCTGTTAGTC GGGTTCTTCA TCATATGCCT 588 0 

GATTTGGATG CGGCTCTCTC ACTGTTTCAT CAACATTTGA AGGAAGATGG GAAACTCATC 5940 

ATTGCTGATT TTACCAAGAC AGAAGCTAAT CATCATGGAT TTGATTTAGC TGAACTGGAA 6000 

AACAAGCTAA TTGAGCATGG TTTTTCATCT GTGCATAGTC AGATTCTCTA TAGTGCTGAA 6060 

GACCTGTTTC AAGGAAATCA CTCAGAATTC TTTTTAATAG TAG C C C AAAA ATCACTCGCC 612 0 

TAGTCAGGGA GTGATTTTTC TATAAGGATG GAAAAAAGAA GGGAAATTTG GTAAGATAGG 6180 

AATATGGATT TTGAAAAAAT TGAACAAGCT TATACCTATT TACTAGAGAA TGTCCAAGTC 62 40 

ATCCAAAGTG ATTTGGCGAC CAACTTTTAT GACGCCTTGG TGGAGCAAAA TAGCATCTAT 63 00 

CTGGATGGTG AAACTGAGCT AAACCAGGTC AAGGAGAACA ATCAAACCCT TAAGCGTTTA 63 60 

GCACTACGCA AAGAAGAATG GCTCAAGACC TACCAGTTTC TCTTGATGAA GGCTGGGCAA 642 0 

ACAGAACCCT TGCAGGCCAA TCACCAGTTT ACACCGGATG CTATTGCTTT GCTTTTGGTG 64 80 

TTTATTGTGG AAGAGTTGTT TAAAGAGGAG G AAAT TACT A TCCTCGAAAT GGGTTCTGGG 654 0 

ATGGGAATTC TAGGCGCTAT TTTCTTGACC TCGCTTACTA AAAAGGTGGA TTACTTGGGA 6600 

ATGGAAGTGG ATGATTTGCT GATTGATCTG GCAGCTAGCA TGGCAGATGT AATTGGTTTG 6 660 

CAGGCTGGCT TTGTCCAAGG AGATGCCGTT CGCCCACAAA TGCTCAAAGA AAGCGATGTG 6720 

GTCATCAGTG ACTTGCCTGT CGGCTATTAT CCTGATGATG CCGTTGCGTC GCGCCATCAA 67 80 

GTTGCTTCTA GCCAAGAACA TACTTACGCC CATC AC T TGC TCATGGAACA AGGGCTTAAG 6840 

TACCTCAAGT CAGACGGATA CGCTATTTTT CTAGCTCCGA GTGATTTGTT GACCAGTCCT 6900 

CAAAGTGATT TGTTAAAAGA ATGGCTGAAA GAAGAGGCGA GTCTGGTTGC TATGATTAGT 69 60 

CTGCCTGAAA ATCTCTTTGC TAATGCCAAA CAATCTAAGA CTATTTTTAT CTTACAGAAG 7 020 
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AAAAATGAAA 


TAGCAGTAGA 


GCCTTTTGTT 


TATCCACTTG 


CTAGCTTGCA 


AGATGCAAGT 


7080 


GTTTTAATGA 


AATTTAAAGA 


AAATTTTCAA 


AAATGGACTC 


AAGGTACTGA 


AATATAAAAT 


7140 


AGATTTTGTT 


ATAATAGTTG 


AAAACGCTTA 


AAAAGGGGTA 


TCATGTTATG 


ACAAAAACAA 


7200 


TTGCAATCAA 


TGCAGGAAGT 


TCAAGTTTGA 


AATGGCAATT 


ATACTTAATG 


C C AG AAG AAA 


7260 


AAGTATTGGC 


GAAAGGTTTG 


ATTGAACGTA 


TCGGTTTGAA 


AGATTCAATT 


TCAACTGTAA 


7320 


AATTTGACGG 


CCGTTCTGAA 


CAACAAATTT 


TGGATATTGA 


AAAT C AT AT A 


CAAGCCGTTA 


7380 


AAATTTTATT 


GGATGACTTG 


ATTCGTTTCG 


ATATTATCAA 


GGCTTATGAC 


GAGATTACAG 


7440 


GTGTTGGACA 


TCGTGTTGTT 


GCTGGTGGAG 


AATATTTCAA 


AGAATCAACA 


GTTGTTGAGG 


7500 


GAGATGTTTT 


AG AAAAAG TT 


GAAGAGTTGA 


GTTTGTTGGC 


TCCTCTACAC 


AACCCGGCCA 


7560 


ATGCAGCAGG 


TGTTCGTGCC 


TTCAAGGAAT 


TGTTGCCAGA 


CATTACCAGT 


GTAGTTGTTT 


7620 


TTGATACTTC 


CTTCCACACA 


AGTATGCCAG 


AGAAAGCTTA 


TCGCTACCCT 


CTACCAACAA 


7680 


AATATTACAC 


AGAAAACAAG 


GTTCGTAAAT 


ACGGTGCTCA 


TGGTACAAGT 


CACCAGTTTG 


7740 


TAGCAGGAGA 


AGCTGCAAAA 


CTCTTGGGAC 


GTCCATTAGA 


AGACTTGAAG 


TTAATTACCT 


7800 


GTCATATTGG 


TAACGGAGGC 


TCAATTACAG 


CTGTGAAAGC 


CGGCAAATCT 


GTAGACACTT 


7860 


CTATGGGGTT 


CACTCCTCTT 


GGTGGTATTA 


TGATGGGAAC 


GCGTACAGGG 


GATATTGATC 


7920 


CAGCTATCAT 


TCCTTATTTA 


ATGCAATATA 


C AG AGG AT T T 


TAACACACCA 


G AAG AT AT C A 


7980 


GTCGTGTTCT 


TAACCGTGAA 


TCAGGTCTTT 


TGGGAGTTTC 


TGCTAATTCT 


AGCGATATGC 


8040 


GCGATATAGA 


AGCAGCTGTA 


GCAGAAGGGA 


ATCACGAGGC 


TAGCTTGGCT 


TATGAAATGT 


8100 


ATGTTGACCG 


TATCCAAAAA 


CATATCGGTC 


AGTACCTTGC 


AGTGCTAAAT 


GGAGCAGATG 


8160 


CCATTGTTTT 


CACAGCAGGT 


GTCGGTGAAA 


AT GC AG AG AG 


TTTCCGTCGT 


GATGTAATCT 


8220 


CAGGGATTTC 


GTGGTTTGGT 


TGTGATGTTG 


ATGATGAAAA 


GAATGTCTTT 


GGCGTTACAG 


8280 


GAGACATCTC 


AACAGAGGCA 


GCTAAAATCC 


GTGTCTTGGT 


TATTCCAACA 


GATGAAGAAT 


8340 


TAGTCATTGC 


CCGTGACGTT 


GAACGCTTGA 


AAAAATAAGT 


GAAACTAAAA 


AAAT AT T C AA 


8400 


TACAAGGAGT 


TGGGAAAGTT 


ATTTTTCCAG 


CTTCTTTTTC 


TGATGAAATT 


GTCCAAAACC 


8460 


TTGCTATGAT 


TGGCTTTTTT 


GAAAAATATG 


GTATAATAGT 


AGTAATTTAA 


TAG AT GG AGT 


8520 


TGAGTTTTGA 


AGAAAAACTT 


TCGTGTAAAA 


AGAGAGAAAG 


ATTTTAAGGC 


GATTTTCAAG 


8580 


GAGGGGACAA 


GTTTTGCTAA 


TCGCAAATTT 


GTGGTCTACC 


AATTAGAAAA 


CCAGAAAAAC 


8640 


CGTTTTCGAG 


TAGGTCTATC 


AGTTAGCAAA 


AAACTGGGGA 


ATGCCGTCAC 


TAG AAAT CAA 


8700 


ATTAAGCGAC 


GGATTCGGCA 


TATTATCCAG 


AATGCAAAAG 


GGAGTCTGGT 


AG AAG ATGT C 


8760 
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GACTTTGTTG TCATTGCTCG AAAAGGAGTC GAAACCTTGG GAT AC G C AG A GATGGAGAAA 8820 

AATCTACTCC ATGTATTAAA AT TAT C AAAG ATTTACCGGG AAGGAAATGG GAGTGAAAAA 8880 

GAAACTAAAG TTGACTAGTT TGCTAGGACT GTCTCTGTTA AT C ATG AC AG CCTGTGCGAC 8 94 0 

TAATGGGGTA AC T AGCG AT A TT AC AG C C G A ATCGGCTGAT TTTTGGAGTA AATTGGTTTA 9000 

CTTCTTTGCG GAAATCATTC GCTTTTTATC GTTTGATATT AGTATCGGAG TGGGG AT TAT 9060 

TCTCTTTACG GTCTTGATTC GTACAGTCCT CTTGCCAGTC TTTCAGGTGC AAATGGTGGC 912 0 

TTCTAGGAAA ATGCAGGAAG CTCAGCCACG CATTAAGGCG CTTCGAGAAC AATATCCAGG 9180 

T CGAG AT AT G GAAAGCAGAA C C AAAC TAG A GCAGGAAATG CGTAAAGTAT TTAAAGAAAT 92 4 0 

GGGTGTCAGA CAGTCAGACT CTCTTTGGCC GATTTTGATT CAGATGCCGG TTATTTTGGC 9300 

CCTGTTCCAA GCCCTATCAA GAGTTGACTT TTTAAAGACA GGTCATTTCT TATGGATTAA 93 60 

CCTTGGTAGT GTGGATACAA CCCTTGTTCT TCCGATTTTA GCAGCAGTAT TCACCTTTTT 9420 

AAGTACTTGG TTGTCCAACA AAGCTTTGTC TGAGCGAAAT GGCGCTACGA CTGCGATGAT 94 80 

GTATGGGATT CCAGTCTTGA TTTTTATCTT TGCAGTTTAT GCGCCAGGTG GAGTCGCCCT 9 540 

ATACTGGACA GTGTCTAATG CTTATCAAGT CTTGCAAACC TATTTCTTGA ATAATCCATT 9 6 00 

CAAGATTATC GCAGAGCGCG AGGCCGTAGT ACAGGCACAA AAAG ATT TGG AAAATAGAAA 9 660 

AAGAAAAGCC AAGAAAAAGG CTCAGAAAAC GAAATAAATA AGGAGGAATC TGGTAGTGGT 972 0 

AGTATTTACA GGTTCAACTG TTGAAGAAGC AAT C C AG AAA GGATTGAAAG AAT T AG AT AT 9780 

TCCAAGAATG AAGGCTCATA T C AAAGTC AT TTCTAGGGAG AAAAAAGGCT TTCTTGGTCT 9 840 

ATT T GGT AAA AAACCAGCCC AAGTGGATAT TGAAGCGATT AGTGAAACGA CTGTTGTCAA 99 00 

AGCAAATCAA CAGGTAGTAA AAGGCGTTCC GAAAAAAATC AATGATTTGA ACGAGCCTGT 9960 

GAAGACGGTT AGTGAAGAAA CCGTTGACCT TGGTCATGTG GTTGATGCTA TTAAAAAAAT 10020 

AGAGGAAGAA GGTCAAGGTA TTTCTGATGA AGTCAAGGCT GAAATCTTAA AAC AT G AAAG 1008 0 

ACATGCCAGC ACTATCTTAG AAGAAACTGG TCACATTGAG ATTTTAAATG AACTTCAAAT 10140 

CGAGGAAGCG ATGAGGGAAG AAGCAGGCGC TGATGACCTT GAAACTGAGC AAGACCAAGC 10200 

TGAAAGTCAA GAACTAGAAG ACTTGGGCTT GAAAGTTGAA ACGAACTTTG ATATTGAACA 102 60 

AGTAGCTACG GAAGTAATGG CTTATGTTCA AACGATTATT GATGACATGG ATGTTGAGGC 10320 

TACACTTTCA AATGATTATA ACCGTCGTAG CATCAATCTA CAAATTGACA CCAACGAACC 103 80 

AGGTCGTATT ATCGGCTACC AT GGT AAAGT CTTGAAGGCC TTGCAACTGT TGGCTCAAAA 10440 

TTATCTTTAC AACCGCTATT CCAGAACCTT CTACGTTACA ATCAATGTCA ATGATTATGT 10500 

CGAACACCGT GCAGAAGTCT TGCAGACCTA TGCGCAAAAA TTGGCGACTC GTGTTTTGGA 105 60 
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AGAAGGGCGC 


AGTCATAAAA 


CAGATCCAAT 


GTCAAATAGC 


GAACGCAAGA 


TTATCCATCG 


10620 


TATTATTTCA 


CGTATGGATG 


GCGTGACTAG 


TTACTCTGAA 


GGTGATGAGC 


CAAATCGCTA 


10680 


TGTTGTTGTA 


GATACAGAAT 


AAGTAAAATC 


AGGTTTATCC 


TGATTTTTTG 


CTAGTTAGAG 


10740 


GAGGTTAAAC 


TGATGTTGAA 


TAAGATAAGA 


GACTATTTAG 


ACTTTGCTGG 


TTTGCAGTAC 


10800 


CGTAATCCTG 


ATAAAGCGGG 


AGCAGAGCGA 


GAGAAGATGC 


TGGCATTCCG 


CCACAAAGGA 


10860 


CAAGAGGCCC 


GAAAGGTTTT 


TACAGAACTG 


GCCAAAGCCT 


TTCAAGCAAG 


CCATCCAGAA 


10920 


TGGCAACTCC 


AACAGACTAG 


CCAGTGGATG 


AATCAGGCCC 


AGCGTTTGAG 


ACCACATTTT 


10980 


TGGGTTTATC 


TACAGAGAGA 


CGGACAAGTG 


ACAGAACCTA 


TGATGGCCTT 


ACGTTTGTAT 


11040 


GGGACATCTA 


CTGACTTTGG 


AATTTCTTTG 


GAAGTCAGTT 


TCATCGAACG 


TAAGAAGGAT 


11100 


GAGCAAACAC 


TGGGCAAGCA 


GGCCAAAGTT 


T TAG AC AT TC 


CAACCGTTAA 


AGGGATTTAT 


11160 


TATCTAACCT 


ACTCTAATGG 


TCAAAGTCAA 


CGGTGGGAGG 


CGAATGAAGA 


AAAGCGTCGT 


11220 


ACTTTACGCG 


AGAAGGTGAG 


AAGTCAAGAA 


GTTCGAAAAG 


TTTTAGTGAA 


GGTAGATGTT 


11280 


CC T AT G AC AG 


AAAATTCGTC 


TGAAGAAGAA 


AT CGT AG AAG 


GCTTATTGAA 


GTCTTATTCT 


11340 


AAAATTCTTC 


CCTATTATCT 


AGCTACGAGA 


AAATAAGATA 


ATTTGTAAAA 


CAT C AT AAAT 


11400 


CATACAGTCC 


AAGAGTGAAC 


AGTCCGCTGT 


GTAATTCTTG 


GTCTTTTTGT 


TTGCGCTTTC 


11460 


GCATTATATA 


ATAAACTTAC 


AAAAACAATT 


CAAAAGGAGA 


AC AATT AT GG 


AAGTCGTTTC 


11520 


AAGTGTTCTA 


AATTGGTTTT 


CTAGCAATAT 


TTTGCAGAAT 


CCCGCATTTT 


TCGTAGGTTT 


11580 


ATTGGTGTTG 


ATAGGATATG 


CACTTTTGAA 


AAAACCTGCC 


CATGACGTTT 


TTTCAGGGTT 


11640 


TGTTAAAGCA 


ACAGTAGGGT 


ATATGTTGCT 


TAACGTGGGT 


GCTGGTGGTT 


TGGTTACAAC 


11700 


CTTTCGTCCA 


ATCTTAGCAG 


CTCTTAACTA 


CAAATTCCAA 


ATTGGTGCAG 


CGGTTATCGA 


11760 


CCCTTACTTT 


GGACTTGCTG 


CAGCAAACAA 


CAAAATTGTA 


GCAGAGTTTC 


C AG AT T T T GT 


11820 


TGGAACTGCA 


ACTACAGCTC 


T AT TG AT T GG 


TTTTGGAATA 


AATATCTTGC 


TCGTAGCTCT 


11880 


TCGAAAGATT 


ACGAAGGTAA 


GAACCCTCTT 


TATTACTGGT 


CACATCATGG 


T AC AAC AAG C 


11940 


TGCAACAGTA 


TCTCTTATGG 


TTCTATTCTT 


AGTACCACAA 


TTGCGCAATG 


CTTACGGTAC 


12000 


AGCAGCGATT 


GGTATCATCT 


GTGGACTTTA 


CTGGGCAGTT 


AGTTCAAATA 


TGACTGTTGA 


12060 


GGCAACTCAA 


CGCTTGACTG 


GTGGTGGCGG 


ATTTGCGATT 


GGTCACCAAC 


AGCAATTTGC 


12120 


AATCTGGTTT 


GTAGATAAAG 


TAGCAGGACG 


CTTTGGTAAG 


AAAGAAGAAA 


GTTTAGACAA 


12180 


TCTTAAATTA 


CCTAAGTTCC 


TCTCAATCTT 


CCACGATACA 


GTTGTTGCAT 


CTGCTACCTT 


12240 


GATGCTCGTA 


TTCTTCGGAG 


CCATTCTTTT 


AATCTTGGGT 


CC AG AC ATT A 


TGTCTAATAA 


12300 
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AGAAGTCATC ACTTCAGGAA CTCTATTCAA TCCTGCTAAA CAAGATTTCT TTATGTACAT 12 3 60 

TATCCAAACA GCCTTTACCT TCTCAGTTTA CTTGTTCGTT TTGATGCAAG GTGTCCGAAT 1242 0 

GTTCGTATCT GAGTTGACAA ACGCCTTCCA AGGTATTTCA AACAAATTGT TGCCAGGTTC 12480 

ATTCCCAGCG GTTGACGTTG CAGCTTCTTA TGGATTTGGT TCTCCAAATG CTGTCTTGTC 12 540 

AGGATTTACC TTTGGTTTGA TTGGTCAATT GATTACAATT GTTTTGCTCA TCGTCTTTAA 12 600 

AAATCCGATT CTTATTATTA CAGGATTTGT ACCAGTGTTC TTTGACAATG CAGCCATTGC 12 6 60 

GGTCTACGCT GATAAACGCG GCGGATGGAA AGCGGCTGTT ATCCTTTCCT TTATATCAGG 12 72 0 

TGTCCTTCAA GTTGCTCTAG GAGCTCTTTG TGTGGCCCTT CTCGATTTGG CATCTTATGG 12780 

TGGCTACCAT GGAAATATCG ACTTTGAATT CCCATGGCTT GGATTTGGAT ATATCTTCAA 12 84 0 

ATACCTTGGT ATTGTTGGTT ATGTACTTGT GTGTCTCTTC TTGCTTGTTA TTCCTCAACT 12 9 00 

TCAATTTGCC AAAGCAAAAG ATAAAGAGAA ATATTACAAC GGTGAAGTTC AAGAAGAAGC 12 9 60 

TTAGTATCTA GAAAAGGAGA AATAAAATGG TTAAAGTATT AGCAGCGTGC GGAAATGGAA 13 020 

TGGGTTCATC AATGGTTATC AAGATGAAGG TTGAAAATGC TCTCCGTAAG C TT AATC AAA 130 8 0 

CAGATTTTAC AGTCAATTCA TGCAGTGTCG GTGAAGCTAA AGGTTTAGCA GTAGGATATG 13140 

ACATCGTAAT CGCTTCTCTT CATTTGATTC AAGAATTGGA AGGGCGAACT AATGGGAAGT 13 2 00 

TAATTGGGCT TGATAACTTG ATGGATGATA AAGAAATCAC CGAAAAACTC AGTCAAGCAC 13 2 60 

TACAGTAAAA GGTTGGAGGG GGCTGGACAG AAACTGAGAG TTATCGTTTC TGTCCTTCTC 133 20 

CCTCTTTAAA TAAAGGAGGC AGATATGAAT TTAAAACAAG CTTTAATTGA CAATGACTCG 13 3 80 

ATCCGACTAG GTTTAGAGGC TAACAATTGG AAAGAAGCAG TCAAGGTAGC AGTAGATCCC 13 440 

TTAATTGAAA GTGGGGCAAT TTTGCCAGAG TATTACGATG CTATCATTGA ATCGACTGAA 13 500 

GAGTATGGGC CTTACTATAT CTTGATGCCA GGTATGGCTA TGCCCCACGC TAGACCTGAA 13 560 

GCAGGTGTGC AAAGTGATGC CTTTTCATTG ATTACCTTAC AAAATCCTGT TGTATTTTCA 13 62 0 

GATGGGAAAG AGGTATCTGT TTTGTTGGCA CTAGCAGCAA CAAGTTCAAA AATTCACACA 13 68 0 

AGTGTAGCCA TTCCACAAAT TATTGCCCTA TTTGAATTAG AAGATTCTAT TGCACGTTTA 13 740 

CAGGCTTGCC AGACTAAAGA AGATGTCTTG GC TAT G AT TG AAGAATCTAA GGATAGCCCT 13 800 

TATCTCGAAG G ATTGG AT TT GGAAAGTTAG AAAGAGGAAT AAAGAAATGA CAAAAAGAAT 13 8 60 

ACCTAATTTA CAAGTTGCAT TAGACCATTC AGACTTGCAA GGAGCGATTA AAGCAGCTGT 13 92 0 

TTCTGTTGGT CAGGAAGTAG ATATTATCGA AGCTGGAACT GTTTGCTTGC TTCAAGTTGG 139 80 

AAGTGAACTG GCTGAAGTCT TGCGTAGCCT TTTCCCAGAT AAGATTATTG TGGCAGACAC 14040 

AAAATGTGCT GATGCTGGTG GAACAGTTGC TAAAAATAAT GCGGTTCGTG GAGCAGACTG 14100 
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GATGACTTGT ATCTGTTGTG CAACCATCCC T ACT AT GG AA GCAGCTCTAA AGGCTATCAA 14160 

GACTGAACGA GGAGAACGAG GCGAAATCCA GATCGAGCTT TATGGCGATT GGACTTTTGA 142 20 

ACAAGCTCAG CTTTGGCTAG ATGCAGGTAT CTCACAAGCT ATTTATCACC AATCTCGTGA 14 2 80 

TGCTCTTCTT GCTGGTGAAA CTTGGGGTGA AAAAGACCTT AATAAGGTTA AAAAACTCAT 14 340 

TGACATGGGC TTCCGTGTAT CTGTAACAGG TGGTCTAGAT GTAGATACTC TCAAACTCTT 14400 

TGAAGGTATT GATGTCTTTA CCTTTATCGC AGGTCGTGGA ATTACAGAGG CTGTGGATCC 144 60 

AGCAGGAGCA GCGCGTGCCT TCAAGGATGA AATCAAACGA ATTTGGGGGT AAATCATGGT 14520 

ACGTCCAATT GGAATTTATG AAAAGGCAAC CCCAACACAC TGTACTTGGC TAGAACGTTT 14 5 80 

AAATTTTGCC AAGGAGTTAG GCTTTGATTT TGTCGAGATG TCTATTGACG AACGTGACGA 14640 

GCGTTTAGCA AGACTTGACT GGAGTAAGGA AGAACGCTTG GAAGTTGTCA AAGCAATCTA 147 00 

TGAAACTGGT GTTCGTATTC CTTCTATCTG TTTTTCAGGC CATCGTCGCT ACCCATTGGG 14 7 60 

TTCAAAAGAT CCAGTTCTAG AGGAAAAATC TCTAGAACTC ATGAAAAAAT GTATCGAATT 1482 0 

AGCTCAAGAC TTGGGAGTTC GTACGATTCA ATTAGCTGGT TACGATGTTT ACTATGAGGA 148 8 0 

AAAGTC AC CC CAGACACGCC AACGTTTTAT CAAAAATTTG AGAAAAGCCT GTGACTGGGC 14 940 

TGAAGAAGCT CAGGTGGTAC TTGCTATTGA AATTATGGAT GATCCTTTCA TCAGTAGCAT 15000 

CGAAAAATAT TTGGCTATAG AAAAAGAGAT TGACTCTCCC TTCCTCTTTG TATATCCAGA 15060 

TATTGGTAAT GTGTCTGCAT GGCATAATGA TATCTATAGT GAGTTTTATC TTGGTCATCA 15120 

TGCCATCGCA GCTCTCCATC TCAAGGATAC TTATGCAGTG ACAGAAAGTT CAAAGGGCCA 15180 

GTTCCGAGAT GTACCTTTCG GGCAAGGTTG TGTCAAATGG GAAGAAGCTT TCGATATTTT 1524 0 

AAAGGAAACC AATTATAATG GACCTTTCCT AATCGAAATG TGGTCTGAAA ATTGTGAAAC 153 00 

AGTAGAAGAA ACACGCGCAG CCATTCAAGA GGCGCAAGCT TTTCTCTATC C ACT CAT T AA 15360 

GAAAGCAGGT TTGATGTAAG ATGAATCAAG TAATCAATGC T ATG CGT AAA CGAGTCTGTG 1542 0 

ATGCCAATCA AT C ATTGC C A AAACATGGAC TTGTCAAATT TACCTGGGGG AATGTATCTG 154 80 

AAGTTAATCG CGAACTCGGT GTCATTGTTA T C AAAC CAT C AGGCGTGGAT TATGACGAAT 1554 0 

TGACACCTGA AAACATGGTA GTGACTGATC TAGATGGTAA GATCCTAGAA GGGGATTTAA 15600 

GACCATCTTC CGACCTCCCA ACTCATGTGC AATTATATAA GACTTGGTCA GAAATTGGTA 15 660 

GTGTGGTTCA CACCCATTCG ACAGAAGCTG TTGGTTGGGC TCAGGCAGGT CGTGATATTC 15720 

CTTTCTACGG AACAACCCAT GCAGATTATT TCTACGGTTC AATCCCTTGC GCCCGTAGTT 15780 

TGACCAAGGA CGAAGTAGAA GTGGCCTATG AAAAAGATAC TGGCCTGGTT ATCGTAGAAG 15 840 
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AGTTTGAACA 


TCGCGGACTT 


AACCCGGTTG 


AAGTACCAGG 


AATTGTTGTA 


CGCAATCACG 


15900 


GTCCATTCAC 


CTGGGGCAAA 


AATCCAGAGA 


ATGCTGTTTA 


TCACTCTGTC 


GTACTAGAGG 


15960 


AAGT AT C AAA 


GATGAATCGC 


TTTACAGAAC 


AAATCAATCC 


AAGAGTTGGA 


CCTGCTCCCC 


16020 


AGT AC AT AC T 


AGAAAAACAC 


TACCAACGTA 


AACATGGACC 


AAATGCTTAT 


TATGGTCAAA 


16080 


AGTAAGAACG 


ATGAAGGAGG 


AGAAAAAGAT 


AAATTTAGCT 


CCTCTTTTTA 


CAT T TG ATTT 


16140 


TTATTGAGAG 


TAAAGTTGGA 


GTTGAAGTAA 


TTTTAAAAGA 


TTTTTTAGAA 


ATAGCGCTTG 


16200 


ATATATATAT 


GGTAAAATAA 


AAAGAATTGC 


TGTGATATCA 


ATAGATTTGG 


GGGATTTTTT 


16260 


AATATGGTAC 


TGGATAAGGC 


AAGTTGTGAT 


TTGCTTCAAT 


ATTTGATGGA 


TCAAGAAACG 


16320 


TCCAAAACGA 


TTATGGCGAT 


TTCGAAAGAT 


TTGAAAGAGT 


CAAGAAGGAA 


AATTTATTAT 


16380 


CACATTGACA 


AAATCAATGC 


TGCTCTGGGT 


GACGAGGCGC 


TT C AC AT C AT 


TAGTATTCCA 


16440 


CGAATTGGTA 


TTCACTTAAC 


GGAAGAGCAG 


AGAGATGCTT 


GTTGTAAACT 


ATTATCGGAA 


16500 


GTAGATTCGT 


ACGATTATAT 


CATGAGTGCG 


CATGAACGTA 


TGATGATAAT 


GTTACTATGG 


16560 


ATAGGTATTT 


CTAAAGAACG 


TATTACGATT 


GAAAAATTGA 


TAGAGTTAAC 


AGAGGTATCT 


16620 


AGGAATACTG 


TTCTCAATGA 


TTTGAATAGT 


ATTCGTTATC 


AACTAACTTT 


GGAACAATAT 


16680 


CAGGTGATCT 


TGCAAGTGAG 


CAAGTCACAG 


GGATACAACC 


TTCATGCCCA 


CCCTCTTAAT 


16740 


AAAATTCAGT 


ATCTTCAATC 


GCTTCTATAT 


CATATTTTTA 


TGGAAGAAAA 


TGCCACTTTT 


16800 


GTATCTATTT 


TAGAAGATAA 


GATGAAAGAG 


AGGTTAGATG 


ATGAGTGTTT 


GCTTTCTGTT 


16860 


GAAATGAACC 


AATTTTTTAA 


GGAACAGGTT 


CCTTTAGTTG 


AACAAGATTT 


AGGGAAGAAA 


16920 


AT AAAC CAT C 


ATGAAATAAC 


TTTTATGTTG 


CAGGTTCTAC 


CTTATTTGCT 


GTTAAGCTGT 


16980 


CATAATGTTG 


AACAGTATCA 


AGAAAGACAT 


CAGGATATAG 


AGAAAGAATT 


TTCTTTGATA 


17040 


AGAAAAAGAA 


TAG AGT AT C A 


GGTGTCTAAG 


AAATTAGGAG 


AACGGTTGTT 


TCAAAAGTTT 


17100 


GAAATTTCTT 


TGTCAGGACT 


TGAAGTTTCT 


CTTGTAGCTG 


TTCTCCTCCT 


CTCCTATCGT 


17160 


AAAGATTTGG 


ATATTCATGC 


AGAAAGTGAT 


GATTTTCGGC 


AATTAAAACT 


TGCTTTAGAA 


17220 


GAATTTATCT 


GGTATTTTGA 


AT C AC AAATC 


CGAATGGAGA 


TTGAGAACAA 


GG AT G ATT T G 


17280 


TTACGAAATT 


TGATGATCCA 


CTGTAAAGCC 


TTGTTATTTA 


G AAAG ACT T A 


CGGTATTTTT 


17340 


TCTAAAAATC 


CTCTAACAAA 


ACAAATTCGA 


TCCAAGTATG 


GAGAATTATT 


TTTAGTCACT 


17400 


AGAAAATCTG 


CGGAAATTTT 


AGAAGGAGCA 


TGGTTTATTC 


GGCTAACAGA 


CGATGATATT 


17460 


GCCTATTTGA 


CG ATT CAT AT 


TGGAGGATTT 


TTAAAATATA 


CACCATCATC 


TCAAAAAAAT 


17520 


ATGAAAAAAG 


TTTATCTCGT 


TTGTGATGAA 


GGTGTTGCGG 


TTTCGAGACT 


TTTGCTGAAA 


17580 


CAATGCAAAC 


TTTATTTTCC 


AAATGAGCAA 


ATTGACACTG 


TATTTACAAC 


AGAACAATTT 


17640 
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AAGAGTGTGG 


AAGATATTGC 


ACAAGTTGAT 


GTAGTGATTA 


CTACTAATGA 


TG ATT T GG AT 


17700 


AGCAGATTTC 


CGATTTTAAG 


GGTTAATCCT 


ATCCTTGAAG 


CAGAAGATAT 


TTTGAAAATG 


17760 


CT AG AC TAT C 


TTAAACACAA 


TATATTTCGT 


AATAAGAGCA 


AAAGTTTCAG 


TGAAAATCTT 


17820 


TCTAGTCTTA 


TTTCGTCTTA 


TATTGTAGAC 


AGCAAGTTGG 


CTAGTAAGTT 


CCAAGAAGAG 


17880 


GTTCAAACAC 


TTATAAATCA 


AGAAATAGTA 


GTTCAAGCTT 


TTTTGGAAGr 


TATTTGAAGG 


17940 


ACAGTCCAAT 


GATGAACACA 


AACCTGTGTk 


TTTCsTGGTC 


TTTTtTAGTG 


TTTTGAAGGG 


18000 


TGG kAT ACT A 


ATCTCAAAGA 


TAACAATTAT 


ATCCAAAGGA 


GGCAACATAT 


GCCAAACGTC 


18060 


AAAGAAATTA 


CAAGAGAGTC 


ATGGATTTTA 


GCCACTTTCC 


CAGAGTGGGG 


AAC AT GGTTG 


18120 


AACGAAGAAA 


TCGAAGAAGA 


AGTCGTACCT 


GAAGGCAACT 


TTGCCATGTG 


GTGGCTAGGC 


18180 


AACTGTGGTA 


CT TGG ATT AA 


GACACCAGCT 


GGTGCTAACG 


TTGTCATGGA 


CCTTTGGTCA 


18240 


AACCGTGGAA 


AATCAACCAA 


AAAAGTGAAA 


GATATGGTTC 


GTGGGCACCA 


AATGGCAAAT 


18300 


ATGGCAGGTG 


TTCGTAAGCT 


GCAACCAAAC 


TTGCGTGTTC 


AGCCAATGGT 


TATCGATCCA 


18360 


TTTGCTATCA 


ACG AAC TAG A 


CTATTACTTA 


GT T T C AC AC T 


TCCACAGTGA 


TCATATCGAC 


18420 


C CAT AC AC AG 


CTGCAGCAAT 


TCTCAATAAT 


CCTAAGTTAG 


AGCATGTTAA 


GTTGG 


18475 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 

CCAGGATTTG GTACCGTTGC AAGT GGTGTG CCTTTCCTCC TAAAGGAAAA TGGAGGAAAA 60 

ATCAATCAAT CAGCACATTC AGATATCAAA GTTGCTAAGG TATTGGTCAA GGATGAAGAT 120 

GAAAAAAATC GCTTGCTTGC AGCAGGGAAT GACTTTAACT TTGTAACCAA TGTGGATGAT 180 

ATTTTATCAG ACCAGGATAT TACT AT CGT A GTGGAATTGA TGGGGCGTAT TGAGCCTGCT 240 

AAAACCTTTA TCACTCGTGC CTTGGAAGCT GGAAAACACG TTGTTACTGC TAACAAGGAC 300 

CTTTTAGCTG TCCATGGCGC AGAATTGCTA GAAATCGCTC AAGCTAACAA GGTAGCACTT 3 60 

TACTACGAAG CAGCAGTTGC TGGTGGGATT CCAATTCTTC GTACTTTAGC AAATTCCTTG 42 0 

G C T TC TG AT A AAATTACGCG CGTGCTTGGA GTAGTCAACG GAACTTCCAA CTTCATGGTG 480 

ACCAAGATGG TGGAAGAAGG CTGGTCTTAC GATGATGCTC TTGCGGAAGC ACAACGTCTA 54 0 
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GGATTTGCAG AAAGCGATCC GACGAATGAC GTAGATGGGA TTGATGCAGC CTACAAGATG 600 

GTTATTTTGA GCCAATTTGC CTTTGGCATG AAGATTGCCT TTGATGATGT AGCCCACAAG 660 

GGAATCCGCA AT AT C AC AC C AGAAGACGTA GCTGTAGCTC AAGAGCTTGG TTACGTAGTG 72 0 

AAATTGGTTG GTTCTATTGA GGAAACTTCT TCAGGTATTG CTGCAGAAGT GACTCCAACC 7 80 

TTCCTACCTA AAGCGCACCC ACTTGCTAGT GTGAATGGCG TAATGAACGC TGTCTTTGTA 84 0 

GAATCTATCG GTATTGGTGA GTCTATGTAC T AC GG AC C AG GTGCGGGTCA AAAACCAACT 9 00 

GCAACAAGTG TTGTAGCTGA TATTGTCCGT ATCGTTCGTC GTTTGAATGA TGGTACTATT 9 60 

GGCAAAGACT TCAACGAATA TAGCCGTGAC TTGGTCTTGG C AAATCCT G A AGATGTCAAA 102 0 

GCAAACTACT ATTTCTCAAT CTTGGCTCTA GACTCAAAAG GTCAGGTCTT GAAGTTGGCT 1080 

GAAATCTTCA ATGCTCAAGA TATTTCCTTT AAGCAAATCC TTCAAGATGG CAAAGAGGGT 1140 

GACAAGGCGC GTGTCGTTAT CATCACACAC AAGATTAATA AAGCCCAGCT TGAAAATGTC 12 00 

TCAGCTGAAT TGAAGAAGGT TTCAGAATTC GACCTCTTGA ATACCTTCAA GGTGCTAGGA 12 60 

GAATAAGATG AAGATTATTG TACCTGCAAC CAGTGCCAAT ATCGGGCCAG GTTTTGACTC 132 0 

GGTCGGTGTA GCTGTAACCA AGTATCTTCA AATTGAGGTC TGCGAAGAAC GAGATGAGTG 13 80 

GCTGATTGAA CACCAGATTG GCAAATGGAT TCCACATGAC GAGCGTAATC TCTTGCTCAA 144 0 

AATCGCTTTG CAAATTGTAC CAGACTTGCA ACCAAGACGC TTGAAAATGA CCAGTGATGT 1500 

CCCTTTGGCG CGCGGTTTGG GTTCTTCCAG CTCGGTTATC GTTGCTGGGA TTGAACTAGC 15 60 

CAACCAACTG GGTCAACTCA ACTTATCAGA CCATGAAAAA TTGCAGTTAG CGACCAAGAT 162 0 

TGAAGGGCAT CCTGACAATG TGGCTCCAGC CATTTATGGT AATCTCGTTA TTGCAAGTTC 1680 

TGTTGAAGGG CAAGTCTCTG CTATCGTAGC AGACTTTCCA GAGTGTGATT TTCTAGCTTA 174 0 

CATTCCAAAC TATGAATTAC GTACTCGCGA CAGCCGTAGT GTCTTGCCTA AAAAATTGTC 1800 

TTATAAGGAA GCTGTTGCTG CAAGTTCTAT CGCCAATGTA GCGGTTGCTG CCTTGTTGGC 18 60 

AGGAGACATG GTGACCGCTG GGCAAGCAAT CGAGGGAGAC CTCTTCCATG AGCGCTATCG 1920 

TCAGGACTTG GTAAGAGAAT TTGCGATGAT TAAGCAAGTG ACCAAAGAAA ATGGGGCCTA 19 80 

TGCAACCTAC CTTTCTGGTG CTGGGCCGAC AGTTATGGTT CTGGCTTCTC ATGACAAGAT 2040 

GCCAACAATT AAGGCAGAAT TGGAAAAGCA ACCTTTCAAA GGAAAACTGC ATGACTTGAG 2100 

AGTTGATACC CAAGGTGTCC GTGTAGAAGC AAAATAAAGA ATAGAAGATA GGATGGGGAA 2160 

ACTCTTGACC AGAGGGGTTC ATATCCTTTT TGTGAAAAGA AGTTTATACT CAATGAAAAT 2220 

CAAAGAGCAA ACTAGGAAGC TAGCCGCAGG CTGCTCAAAA CAGTGTTTTG AGGTTGCAGA 22 8 0 

TAGAACTGAC GAAGTCAGCT CAAGACACTG TTTTGAGGTT GCAGATAGAA CTGACGAAGT 2340 
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C AGT AAC CAT ACTACGGTAA GGTGACGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTAT 24 00 

TAG T T AAAAA CGTGATAAAG GAGAAATAAA GATGGCAGAA ATTTATCTAG CAGGTGGTTG 2460 

TTTTTGGGGC CTAGAGGAAT ATTTTTCACG CATTTCTGGA GTGCTAGAAA CCAGTGTTGG 2 52 0 

CTACGCTAAT GGTCAAGTCG AAACGACCAA TTACCAGTTG CTCAAGGAAA CAGACCATGC 25 80 

AGAAACGGTC CAAGTGATTT ACGATGAGAA GGAAGTGTCA CTCAGAGAGA TTTTACTTTA 2 640 

TTATTTCCGA GTTATCGATC CTCTATCTAT CAATCAACAA GGGAATGACC GTGGTCGCCA 2 700 

ATATCGAACT GGGATTTATT ATCAGGATGA AGCAGATTTG CCAGCTATCT ACACAGTGGT 2 7 60 

GCAGGAGCAG GAACGCATGC TGGGTCGAAA GATTGCAGTA GAAGTGGAGC AATTACGCCA 2 82 0 

C T AC AT T CT G GCTGAAGACT ACCACCAAGA CTATCTCAGG AAGAATCCTT CAGGTTACTG 2880 

TCATATCGAT GTGACCGATG CTGATAAGCC ATTGATTGAT GCAGCAAACT ATGAAAAGCC 2 94 0 

TAGTCAAGAG GTGTTGAAGG CCAGTCTATC TGAAGAGTCT TATCGTGTCA CACAAGAAGC 3 00 0 

TGCTACAGAG GCTCCATTTA CCAATGCCTA TGACCAAACC TTTGAAGAGG GGATTTATGT 3 0 60 

AGATATTACG ACAGGTGAGC CACTCTTTTT TGCCAAGGAT AAGTTTGCTT CAGGTTGTGG 312 0 

TTGGCCAAGT TTTAGCCGTC CGATTTCCAA AG AGT TG AT T CATTATTACA AGGATCTGAG 3180 

CCATGGAATG GAGCGAATTG AAGTTCGTTC TCGTTCAGGC AGTGCTCACT TGGGTCATGT 3 2 40 

TTTCACAGAT GGACCGCGGG AGTTAGGCGG CCTCCGTTAC TGTATCAATT CTGCTTCTTT 3 300 

ACGCTTTGTG GCCAAGGATG AGATGGAAAA AGCAGGATAT GGCTATCTAT TGCCTTACTT 3 3 60 

AAACAAATAA AACAGAGAGT GGGGCTTCCC ACTTTCTTCA TTTCTAGAAT ATGAATAGAA 342 0 

GGGATTTATG AAACACCTAT TATCTTACTT CAAACCCTAC ATCAAGGAAT CAATTTTAGC 34 80 

CCCCTTGTTC AAGCTGTTAG AAGCTGTTTT TGAGCTCTTG GTTCCCATGG TGATTGCTGG 3 54 0 

GATTGTTGAC CAATCTTTAC CTCAGGGAGA TCAAGGTCAT CTCTGGATGC AGATTGGCCT 3 600 

GCTCCTTATC TTTGCAGTAA TTGGCGTTTT AGTGGCCTTG ATAGCTCAAT TTTACTCAGC 3 6 60 

AAAGGCAGCA GTAGGTTCTG CTAAGGAATT GACAAACGAT CTTTATCGTC ATATTCTTTC 372 0 

CTTGCCCAAG GACAGCAGAG ACCGTCTGAC AACTTCTAGT TTGGTCACTC GCTTGACTTC 37 8 0 

GGATACCTAC CAGATTCAGA CTGGTATCAA TCAATTCCTG CGTCTCTTTT TACGAGCGCC 3 84 0 

CATTATCGTT TTTGGTGCCA TTTTTATGGC TTATCGAATC TCAGCTGAGT TGACTTTCTG 3 900 

GTTCTTAGTC TTGGTTGCCA TTTTGACCAT TGTCATTGTA GGGTTATCTC GATTGGTCAA 3 96 0 

TCCTTTCTAC AGT AGT CT C A GAAAGAAAAC GGACCAACTG GTT C AGG AAA CGCGCCAGCA 4 02 0 

ATTGCAAGGG ATGCGGGTTA TTCGTGCTTT TGGTCAAGAA AAACGAGAGT TACAGATTTT 4 08 0 
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TCAAACCCTT AACCAAGTTT ATGCTAGATT ACAAGAAAAG ACAGGTTTCT GGTCTAGTTT 4140 

ATTAACACCT CTGACCTATC TGATTGTCAA TGGAACTCTT CTCGTTATTA TCTGGCAAGG 4200 

CTATATTTCA ATTCAAGGAG GAGTGCTCAG TCAAGGTGCT CTCATTGCTC TTATCAATTA 4 2 60 

CCTCTTACAG ATTTTGGTGG AATTGGTCAA GCTAGCCATG TTGATCAATT CCCTCAACCA 4320 

GTCCTATATC TCAGTCAAGC GAATCGAGGA AGTCTTTGTT GAGGCTCCAG AGGATATCCA 43 80 

TTCAGAGTTA GAACAAAAGC AAGCTACCAG AGATAAGGTT TTACAAGTCC AAGAATTGAC 444 0 

CTTTACCTAT CCTGATGCGG CCCAGCCTTC TCTGAGATAC ATTTCCTTTG AT AT G ACTC A 4500 

AGGACAAATT CTAGGTATCA TCGGGGGAAC TGGTTCTGGT AAATCAAGCT TGGTGCAACT 4560 

CTTACTTGGA CTTTATCCAG TAGACAAGGG GAACATTGAC CTTTATCAAA ATGGACGTAG 4620 

TCCTCTTAAT TTGGAGCAGT GGCGGTCTTG GATTGCCTAT GTACCTCAAA AGGTCGAACT 4680 

CTTTAAAGGA ACCATTCGTT CCAACTTGAC TCTAGGTTTC AATCAAGAAG TATCTGACCA 4740 

GGAACTCTGG CAGGCCTTGG AGATTGCGCA AGCTAAGGAT TTTGTCAGTG AAAAGGAAGG 4800 

ACTCTTGGAT GCTCTAGTTG AGGCAGGGGG GCGAAATTTC TCAGGTGGAC AAAAACAAAG 4860 

ATTGTCTATC GCCCGAGCAG TCTTGCGCCA GGCTCCGTTT CTCATCCTAG ATGATGCAAC 4 92 0 

CTCGGCACTG GATACCATTA CAGAGTCCAA GCTCTTGAAA GCTATTAGAG AAAATTTTCC 4 980 

AAACACGAGC TTAATTTTGA TCTCTCAACG AACCTCAACT TTACAGATGG C G G AC C AG AT 5040 

TCTCCTCTTG GAAAAAGGTG AGTTGCTAGC TGTTGGCAAG CACGATGACT TGATGAAATC 5100 

CAGCCAAGTC TATTGTGAAA TCAATGCATC CCAACATGGA AAGGAGGACT AGAATGAAAC 5160 

GACAAACTGT AAAC C AG AC G CTCAAACGTT TAGCCGTAGA TTTAGCAAGC CATCCTTTCC 522 0 

TCCTTTTCCT AGCCTTTCTA GGAACTATTG CCCAAGTTGG CTTATCAATT TACCTACCTA 52 80 

TTCTGATTGG GCAGGTCATT GACCAAGTCC TAGTGGCTGG TTCATCACCA GTTTTTTGGC 5340 

AGATTTTTCT C C AG AT G CT C TTGGTGGTAA TAGGAAATAC TCTGGTACAA TGGGCCAATC 5400 

CTCTCCTCTA TAATCGTCTA ATCTTCTCTT ATACCAGAGA TTTACGGGAG CGAATCATCC 5460 

ATAAGCTCCA TCGTTTACCG ATTGCCTTTG TAGATAGGCA AGGTAGTGGA GAGATGGTTA 552 0 

GTCGTGTAAC CACGGACATC GAACAGTTGG CAGCTGGCTT GACCATGATT TTTAACCAAT 5 580 

TTTTCATTGG TGTTTTGATG ATTTTGGTCA GTATTCTAGC CATGCTCCAA ATTCATCTCC 5640 

TCATGACTCT CTTAGTCTTG CTGTTGACGC CACTGTCCAT GGTGATTTCA CGCTTTATTG 5700 

CCAAGAAATC CTATCATCTC TTCCAGAAGC AAACAGAGAC GAGGGGAATT CAGACTCAGT 5760 

TGATTGAAGA ATCGCTTAGT CAGCAGACTA TAATCCAGTC CTTCAATGCT CAAACAGAAT 5820 

TTATCCAAAG ATTGCGTGAG GCTCATGACA ACTACTCAGG CTATTCTCAG TCAGCCATCT 5880 
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TTTATTCTTC AACGGTCAAT CCTTCGACTC GCTTTGTAAA T GC AC T CAT T TATGCCCTTT 5940 

TAGCTGGAGT AGGAGCTTAT CGTATCATGA TGGGTTCAGC CTTGACCGTC GGTCGTTTAG 6000 

TGACTTTTTT GAACTATGTT CAGCAATACA CCAAGCCCTT T AACG AT AT T TCTTCAGTGC 60 60 

TAGCTGAGTT GCAAAGTGCT CTGGCTTGCG TAGAGCGTAT CTATGGAGTC TTAGATAGCC 612 0 

CTGAAGTGGC TGAAACAGGT AAGGAAGTCT TG AC G AC C AG TGACCAAGTT AAGGGAGCTA 6180 

TTTCCTTTAA ACATGTCTCT TTTGGCTACC ATCCTGAAAA AATTTTGATT AAGGACTTGT 6240 

CTAT C GAT AT TCCAGCTGGT AGTAAGGTAG CCATCGTTGG TCCGACAGGT GCTGGAAAAT 6300 

CAACTCTTAT CAATCTCCTT ATGCGTTTTT ATCCCATTAG CTCGGGAGAT ATCTTGCTGG 63 60 

ATGGGCAATC CATTTATGAT TATACACGAG TATCATTGAG ACAGCAGTTT GGTATGGTGC 642 0 

TTCAAGAAAC CTGGCTCACA CAAGGGACCA TTCATGATAA TATTGCCTTT GGCAATCCTG 6480 

AAGCCAGTCG AGAGCAAGTA ATTGCTGCTG CCAAAGCAGC TAATGCAGAC TTTTTCATCC 654 0 

AACAGTTGCC ACAGGGATAC GATACCAAGT TGGAAAATGC TGGAGAATCT CTCTCTGTCG 6600 

GCCAAGCTCA GCTCTTGACC ATAGCCCGAG TCTTTCTGGC TATTCCAAAG ATTCTTATCT 6 660 

TAGACGAGGC AACTTCTTCC ATTGATACAC GGACAGAAGT G C TGGT AC AG GATGCCTTTG 672 0 

CAAAACTCAT GAAGGGCCGC ACAAGTTTCA TCATTGCTCA CCGTTTGTCA ACCATTCAGG 6780 

ATGCGGATTT AATTCTTGTC TTAGTAGATG GTGATATTGT TGAATATGGT AACCATCAAG 6840 

AACTCATGGA TAGAAAGGGT AAGTATTACC AAATGCAAAA AGCTGCGGCT TTTAGTTCTG 6900 

AATAAGCCAT TCTCTTTTGA AAGTTTATGG ACGAAAAAAG TTGCCTTCGA GTGACTTTTT 696 0 

TGTTACAATA GCTAGAAAAA TTGTTCACTG TAATACTCAA TGAAAATCAA AGAGCAAACT 7 02 0 

AGGAAGCTAG CCGTAGGTTG CTCAAAGCAC AGCTTTGAGG TTGTAGATAA GACTGACGAA 7 080 

GTCAGTTCAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAG CTCAAAACAC 714 0 

TGTTTTGAGG TTGCAGATAG AACTGACGAA GTCAGCTCAA AACAGG 718 6 
(2) INFORMATION FOR SEQ ID NO : 40: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CTGAAAATTC TAAAAAATTT ATAAGTAAGG AATTAATTAG TTATTTTTGT GATAAAGTTT 60 
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ATGATGAAAT ATTTGTTGAA GAGGTAGTTC CGCACGTTTT TCTGCCATAT GAATCTGACT 12 0 

TACTTCTTAT TTTACCAGCT ACGGCAAATG TGATTGGCAA AATTGCTAAT GGTATTGCTG 180 

ATGATTTAGT TACAGCAACT GTTTTAAACT TTAATAAAAA AATAATTTTT TGTCCCAATA 24 0 

TGAACTCTAC TATGTGGGAC AATCACATAG TTCAAAGAAA TGTATCAATT CTAAAGGAGT 3 00 

TGGGACATAT ATTTTTATTT GAGTCTAAAA AAAC AT AT G A GGTAGGATTG CGTAAAGCAA 3 60 

TAGATTCAAC ATGTTCAATG TTACAACCAC AGTCGTTAGT AAAAGAACTT ATCAAATTAG 42 0 

AAAATATTGT CCTTGAAGAG GG AC AT T AAA AACTACTGAG AATATTAATG AGGGGAAAAA 480 

ATGGAAAATT CATCAATCGA TGTAGATATG CTGTTGGAAG AATT G AC AC A AGAAGCAATG 54 0 

GTCGTTGTTG CTGTTGATAA GGACTGTTAA TTTAAACTTA TGGCAATATA TGAAAGGTTA 600 

CTGGATGTTT TAAATTATGC AGGCAGTAGC CTTTTATTAT ATACAAATGG ATAAAGTAAG 660 

GATAATACAA TGATTAATAA AAAAATACAA CAAGTTGTTT TGGAATCATT ACAGAATTTT 72 0 

TTGAATGGGA ACTTCATTTC GCCTTGTGTA GTCTATGATT TTGGCTTGCT GGAAACTGTA 780 

CTTGATGAAT T T AAAAATC A AATTCCTGTA ACATTCAATT ACCAACTTTT TTATGCCGTT 840 

AAAGCAAATT CAAATGAGAA GAT ACT T G AA TTCTTAGTAG AT AAAAT T G A TGGAGTTGAT 9 00 

GTGGCGTCAT TATCTGAATT AGATGTGGCT AAAAAATTTT TCCCACCAAC TCAAATTTCT 9 60 

GTTAATGGTC CCGCATTTTC TTATGAAACT TTATATAATC TGATTAAAAA ACAATATAAA 1020 

GTTGATATTA ACTTTTTGGA ACATCTTCAA CAATTTTCCC CAAAAGAATC TGTTGGAATA 1080 

AGAGTAACGG AGCCAGATGA ACT TAATAAT CGTATGAGTC GATTTGGAAT AAATATTTGC 114 0 

AGTGATAATT GGACTAGTAA T TT AC AAAAT CCTTTAATTA CACGACTGCA TTTTCATTTT 1200 

GGAGAAAAAG ATGATAAATT TATTGTTAAG TTAGATAAAA TAT TAT T T AA GTTACAAGAA 1260 

ATTAATAAAC TTAGAGAGGT TAGAGAAATA AATCTTGGAG GCGGTTTTAT GAAATTATTT 13 2 0 

ATGG AAAAT C GTTTGAAAGA ATTTTTTCTA TCACTTATGG AAATCTATAA AAAGTACGAT 13 80 

AT TG AT AGT A CTGTGACTAC AATAATAGAA CCAGGTAGTG CAATTACTTC ATTTTCTGCC 144 0 

TATATGATTA CTAGCCCAGT T AATGT T AGT GAGGTGAATG AGCAGCAGGT TATCACGTTA 15 00 

GACACATCAA TAT AC AC C AA TACATTATGG TTTGTTCCGC ATATTATTAC AACGTTAAAT 156 0 

TCAAGTAGTA AAGAGCGTTA TAGTACTATT CTCTATGGTA ATACCTGTTA TGAACATGAC 162 0 

AAGTATAAAA TGAAAGTTTC GCTTCCAAGG TT AACT C AAA ATAGCAGTAT AGTGTTTTTT 1680 

CCTGTAGGAG CTTATATAAA AAGC AAT CAT T C AAATTT AC ATCGTAATGA TTTTATGCGG 174 0 

GAGGTATATT TGTGGACAAA AAACTTGACA TATTAGATAA AGTTAAGGAA TATTTAGGAA 1800 

ATAAAACTAC TCAAATTCTG GATAATCAAT ATAAAGAATT TTTGAAACTT AATGATATAA 1860 
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GGCGAGCGTT TGGTATTTCA GAAAAAGTAT TAAACAATTC TTTTAATTTT AC G AG T AAAG 192 0 

AATTTAATGA . TTTAATTAAT AACGAAAATT ATTTATTCGA ATATGCATGT AGAATTAGAG 1980 

AGGAATGGAG AAAAAAATGC TTTAATCATT CTTATCGTTT TCTATGCTCA CCTATAATTA 2 04 0 

CAGATGATTT TCTTAACACG AAGACATTGA GAAGTAGCCA AATTGAATAT AAATATGAGC 2100 

GATATTTATC GAAAAGTTCG ATAGGCGATA GAGCGGTTGA TGGCTTTGTT TCCTTCAATA 2160 

CTTTAACAGC TAATGGTATG TCTGCTATTA AACTATGTCT TGAGATATTA AACTCTATTT 2220 

TCTTCAAGAA GAAGATTGAT TT ATT AT ATT CAACCGGATA TTATGAAACA AGATTTTTAT 2280 

TAAATAATCT TGCTAAATCA GGTATTAGTT GCTATGAGGT AAGTAATTGT GAATTGGATA 2 3 40 

AAGATAAATT TTATAATGTA TTCATGATGG AACCCAATCG AGCCGATTTA ACATTACAAA 2400 

AAACTG AT TT CAAGATAGTA GAATATTTTG TTAAGTATAA AAATAATTCA ATAAAAGTCG 2460 

TTATTTTAGA TATTTCATAT CAAGGTTCTA ATTTTAAATT AGTAGAATTT T T AG AG AAAT 2 52 0 

TTAAATTTGC GAATGTAATT ATTTTTGTGG TACGATCTTT GATAAAATTA GATCAAATGG 2 580 

GATTAGAATT GACAAATGGG GGAATAATAG AAGT GTTT AT TCCTAATCAT TTGAGAAAGT 2 64 0 

TGAAAAATTT TATTGAAGAG GAATTCAATA AAT T TAG AAA TTCTCACGGA GCTAATCTAA 2 700 

GCCTCTATGA ATACTGTTTG CTTGATAATT CTTTAACTTT AAAAAATGAT TGGAACTATT 2 7 60 

CTGATTTAGT TATGAAATTT ACGAGTAATT TTTATGCTGA TATAAAAGAC TTGTTCATGG 2 82 0 

AAAATTCTGA TATTGAAATC ATCCATGAAG AGGGAGTACC TTTTGTATTT TTAGATTTAA 2 880 

TAGGTGAAGG TAAAAAAGAA TATGAAATGT TTTTTCAATG GTTAAACTTC TTTTACAAAC 2 940 

AGCTTGGAAT CACATTGTAT GCTAGAAATA GTTTTGGGTT TCGGAATCTA ACAGTAGAGT 3 000 

ATTTTGGAAT TATTGGGACA GAAAGATATA TATTTAAGAT TTGTCCAGGT GT T TAT AAAG 3 060 

GGTTAAGTTA TTATTTGATG AAATTTTTAT TAAAATCTTT TTCAAATGAA TATTTAAAAA 312 0 

CTACTGATGA GGTTAATAGA TGAAAAATTT GATAAAGTTG CTAATAATTA GATTGATTGT 3180 

TAACTTAGCA GACAGTGTAT TTTATATAGT AGCATTGTGG CACGTTAGCA ATAATTATTC 3 24 0 

TTCGAGCATG TTCTTAGGAA TATTTATTGC AGT AAAT TAT CTACCGGATT TGT TACT AAT 3 3 00 

CTTTTTTGGA CCAGTTATTG ACAGAGTAAA TCCGCAAAAA ATT C TT AT AA TATCAATTTT 3 3 60 

GGTTCAATTA GCAGTGGCTG TAATATTTTT ATTATTATTA AACCAAATAT CATTTTGGGT 3 42 0 

GATAATGAGT CTAGTGTTTA TTTCAGTAAT GGCTAGCTCC ATAAGTTACG TGATAGAAGA 34 80 

TGTGTTGATT CCTCAAGTGG TAGAATATGA TAAGATTGTA TTTGCAAATT CTCTTTTTAG 354 0 

TATTTCGTAT AAAGTATTAG ATTCTATTTT TAATTCATTC GCATCATTTT TACAGGTGGC 3 600 
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AGTAGGATTT ATTTTATTGG TTAAGATAGA TATAGGCATA TTTTTACTTG CTCTATTTAT 3 6 60 

ATTGTTGTTG TTAAAATTTA GAACTAGCAA TGCGAATATA GAAAACTTCT CTTTCAAATA 372 0 

TTACAAGAGA GAAGTGTTGC AAGGTACAAA GTTTATTTTA AATAATAAAT TATTATTTAA 37 80 

AACCAGTATT TCTTTAACGC TTATAAACTT TTTTTATTCA TTTCAGACAG TAGTTGTACC 3 840 

GATTTTTTCT ATTCGATATT TTGATGGTCC GATTTTTTAT GGTATTTTTT TAACTATTGC 3 900 

TGGTTTGGGT GGTATATTGG GAAATATGCT AGCGCCAATC GTAATAAAAT AT TT AAAATC 3 9 60 

GAATCAAATT GTTGGTGTAT TTCTTTTTTT GAACGGCTCA AGTTGGT TAG TAGCAATTGT 4020 

TATAAAAGAC TATACTTTAT CACTTATTTT ATTTTTCGTT TGTTTTATGT CTAAAGGAGT 40 8 0 

CTTCAATATT ATTTTTAATT CGTTGTACCA AC AAAT AC C T CCACATCAAC TTCTTGGTAG 414 0 

GGTAAATACT ACCATTGATT CTATTATTTC TTTTGGAATG CCAATTGGTA GTTTAGTTGC 42 00 

AGGAACGCTT AT TG AT TT G A ATATTGAATT AGTGTTAATT GCTATTAGCA TACCTTATTT 42 60 

TTTGTTTTCT TATATTTTTT ATACGGATAA TGG AT TG AAA GAATTTAGTA TAT AT T AG AA 4320 

ATGTTTATGT TCATTCAAAA G C AT AATG AC TATAACTGAA AAAGAAAAGT GATATCTTTA 4 3 80 

AGGTTGTTCT TCTTGGTGGT GAGATTCGTG AGACAACCCA AGCTTTTGTC GGAAAGATTA 444 0 

CCAATGCTTT GATGGATAGG ATGTACTTTA GCAAGATGTT TTTAGTGGTA ACGGTATCGT 4500 

GGATGGACGT GTAATAACCT CTTCTTTCGA GGAGTATTTT ACTAAAAAAC TAGCCTTGGA 4 5 60 

GCGTTCCCCA GAAACGGACT TACTCATTGA CTCTTCAAAG ATTTGGGGAG AAGATTTTGC 4 62 0 

TTCATCTGTT CCTTGAAAAA AGTCACAGCA GTCATCACAG ACGATAGTAC TGAACAAAAC 4 680 

TATGAAGAGT TAGAAATTTA TACGCAGGTG ATTGTATAAA GGATCTGGAA ATAGATAAGA 4740 

AGTTGATTAG TATTGACCTA GGTGGTACAA ATATTAAGAT TACTGTTCTT TCAAATGACG 4 8 00 

GTGAGATTGA AACTTTGTGG AGT AT T AC AA CAGATACAAG TGAGAAAGGT TCTCAAATTA 4860 

T AT CGG AC AT CATCAGTTCT ATTAAAAATA AATTGACCGA ACGGAATATT CCTGATAGCG 4 92 0 

ACCTTCTTGG AATCGGTATG GGAAGTTGCT CATCATACTT TCCTTGTAAA TCATAGGGGC 4980 

TATAAACTCT CCGTCTACTT GTCCTGCAAC AATTGAAGTC TGCTCAAAAC GCCGTCCGCT 5 040 

AATCTTTTCA TAGACTTTCT CCCTTTTAGG AGCCTAGCTT TCTAGTTTGT TCTTTGATTT 5100 

TTATTGAGTA TACCACTATT TTACTCCCTC TGGCAAGGGA CTTTGTCTAT GTGGAGGGAT 516 0 

TGGGCTCCTA TGTGGTGGAG CTTTTCTGTT CTTTCTGAAA TATGGTATAA TAGCACTAAT 522 0 

CAATTTCTAG GAAAATAGAT ACAGAAAGGG G CTGAAAG AT GTCTCATATT ATTGAATTGC 5280 

CAGAGATGCT GGCAAACCAA ATCGCGGCTG GAGAGGTCAT TGAACGTCCT GCCAGTGTGG 5340 

TCAAAGAGTT GGTAGAAAAT GCCATTGACG CGGGCTCTAG TCAGATTATC ATTGAGATTG 5400 
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AGGAAGCTGG TCTCAAGAAG GTTCAAATCA CGGATAACGG TCATGGAATT GCCCACGATG 54 60 

AGGTGGAGTT GGCCCTGCGT CGCCATGCGA CCAGTAAGAT AAAAAATCAA GCAGATCTCT 5520 

TTCGGATTCG GACGCTTGGT TTTCGTGGTG AAGCCTTGCC TTCTATTGCG TCTGTTAGTG 558 0 

TCTTGACTCT GTTAACGGCG GTGGATGGTG CTAGTCATGG AACCAAGTTA GTCGCGCGTG 564 0 

GGGGTGAAGT TGAGGAAGTC ATCCCAGCGA CTAGTCCTGT GGGAACCAAG GTTTGTGTGG 57 00 

AGGATCTCTT TTTCAACACG CCTGCCCGTC TCAAGTATAT GAAGAGCCAG CAAGCGGAGT 57 6 0 

TGTCTCATAT CATTGATATT GTCAACCGTC TGGGCTTGGC CCATCCTGAG ATTTCTTTTA 5820 

GCTTGATTAG TGATGGCAAG GAAATGACGC GGACAGCAGG GACTGGTCAA TTGCGCCAAG 5880 

CAATCGCAGG GATTTACGGT TTGGTCAGTG CCAAGAAGAT GATTGAAATT GAGAACTCTG 5940 

ACCTAGATTT CGAAATTTCA GGTTTTGTGT CCTTGCCTGA GTTGACTCGG GCTAACCGCA 6000 

ATT AT AT C AG CCTCTTCATC AATGGCCGTT ATATTAAGAA CTTCCTGCTC AATCGTGCTA 60 60 

TTTTGGATGG TTTTGGAAGC AAGCTTATGG TTGGACGTTT TCCACTGGCT GTCATTCACA 612 0 

TCCATATCGA CCCTTATCTA GCGGATGTCA ATGTGCATCC AACTAAGCAA GAGGTGCGGA 618 0 

TTTCCAAGGA AAAAGAACTG ATGACTCTGG TTTCAGAAGC TATTGCAAAT AGTCTCAAGG 6240 

AACAAACCTT GATTCCAGAT GCCTTGGAAA ATCTTGCCAA ATCGACCGTG CGCAATCGTG 63 00 

AGAAGGTGGA GCAAACTATT CTCCCACTCA AAGAAAATAC G C T CT AC T AT GAGAAAACTG 63 6 0 

AGCCGTCAAG ACCTAGTCAA ACTGAAGTAG CTGATTATCA GGTAGAATTG ACTGATGAAG 642 0 

GGCAGGATTT GACCCTGTTT GCCAAGGAAA CCTTGGACCG ATTGACCAAG CCAGCAAAAC 64 80 

TG C ATTTTGC AGAGAGAAAG CCTGCTAACT ACGACCAGCT AGACCATCCA GAGTTAGATC 6 540 

TTGCTAGCAT CGATAAGGCT TATGACAAAC TGGAGCGAGA AG AAGC AT C C AGCTTCCCAG 660 0 

AGTTGGAGTT TTTCGGACAA ATGCACGGGA CTTATCTCTT TGCCCAAGGG CG AG AT GG AC 6 6 60 

TTTACATCAT AGATCAGCAC GCTGCTCAGG AACGGGTCAA GTACGAGGAG TACCGTGAAA 672 0 

GCATTGGCAA TGTTGACCAA AGCCAGCAGC AACTCCTAGT GC C C TAT AT C TTTGAATTTC 67 8 0 

CTGCGGATGA TGCCCTGCGT CTCAAGGAAA GAATGCCTCT CTTAGAGGAA GTGGGCGTCT 684 0 

TTCTAGCAGA GTACGGAGAA AATCAATTTA TTCTACGTGA ACATCCTATT TGGATGGCAG 6900 

AAGAAGAGAT TGAATCAGGC ATCTATGAGA TGTGCGACAT GCTCCTTTTG ACCAAGGAAG 69 60 

TTTCTATCAA GAAATACCGA GCAGAGCTGG CTATCATGAT GTCTTGCAAG CG AT CT AT C A 7020 

AGGCCAATCA TCGTATTGAT GATCATTCAG CTAGACAACT C C T CT AT C AG CTTTCTCAAT 7 08 0 

GTGACAATCC CTATAACTGT CCTCACGGAC GTCCTGTTTT GGTGCATTTT ACCAAGTCGG 714 0 
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ATATGGAAAA GATGTTCCGA CGTATTCAGG AAAAT C AC AC CAGTCTCCGT GAGTTGGGGA 72 00 

AATATTAAAA GTATAAAAAA GTCTGGGAAA AATTTTCAAA ATCAAAAAAA CG CAT AAAAT 72 60 

CAGGTGTTCA AAAACCTTGA TTTTATGCGT TTTATCATGG AAATAGTTAC TTCATTTTTT 732 0 

CCTAATTCTT TTCGAAACTC TTTTTAAACG ACGTCAGTTT TATCAGTAAT CTCAAAACAG 73 80 

TGTTTTGAGC TAATTTTGCC AGTTTTGTCT GTAACATCGA AGTTGTGTTT TACCACTCTG 7440 

CGACTGGTTT CCTAGTTTGC TCTATGATTT T C AC AG AG C A TTAAATTGCG ATTTTGCCAA 7500 

GTTTCTTTAT TCGTCTAAAA GTAGAGTCTG TTCTATGCGT CTAATGTACG AATCAGGTTG 7 5 60 

ACCATTTCAA TAGCTCCTTG TGCACACTCA GAACCCTTAT TTCCTGCTTT AGTACCAGCT 7 620 

CGTTCTATGG CTTGTTCAAT TGTATCTGTC GTTAGCACAC CAAACATAAC AGGAATTTCG 7 680 

CTATTTAAAC TGATTTGGGC GATTCCCTTA GATACCTCGC TACATACATA AT C AT AAT G A 774 0 

CTTGTATTCC CTCTAATGAC AGCTCCCAAG CAGATAATTG C AT C AT ATTT TTTACTTTTT 7 800 

GCCATTTTTG ATGCAATCAG TGGTATTTCA AAAGCTCCTG GAACCCAGGC TACCTCTATA 7 860 

TCTTTCTCGT TTACATTCTC TCTTTTGAGA TTATCTAGTG CTCCAGATAA TAATTTTGAA 7 92 0 

GTTATAAATT CAT T AAAT C T CGCTACAACA ATACCTATTT TAATATTGTT TGC TACT AAA 7980 

TTACCTTCAT AAGTGTTCAT TTATTTTTCC TCCATATTTA AAATGTGACC CATTCGATTT 804 0 

TTCTTTGTTT CTAAATAAAA ACTATCGTAA GGATTGGCTT CTATTTCGAT T GAT ATT C T A 8100 

CTGGAAATGG T AAT T C CAT A TTTTTCTAAC TGTTCAACCT TGTCAGGATT ATTTGTCAGT 8160 

AAATGAAGTG ACTGAAGTCC CAGATCTTTA AGCATTTTTG CTCCAATATG ATATTCTCTT 822 0 

AAATCACCTT CAAAGCCTAA TGCAAGATTG GCATCAAGCG TATCCATGCC TTGATCTTGT 82 80 

AAATGATAGG CTTTTAATTT ATTGATAAGT CCAATTCCTC GTCCCTCCTG TCGCAAGTAA 83 40 

AGTAAGACAC CCGAACCATT CTCAACAATC ATTTTCATAG CCTTATCGAA TTGCTGTCCA 8400 

CAATCGCAAC GTAAAGAGCC TAAAACATCT CCTGTTAAAC ATTCGGAGTG G AC C CG AC AT 84 60 

AAT AC AT TGG CTTCATCCTC TATATTTCCC ATAATAAGAG CAAGATGATG TTCCCCATTT 852 0 

AGTTTATCTA TATAG CTAAT TGCTTTGAAA TTACCGTATC TAGTAGGCAT ATTGACAGTT 8580 

GAAACTCGTT CTACCAGCTG ATCATATACT TTTCTATATT CTTGTAATTC TTTG AT GGT A 8 640 

ATTAGTGGAA TGTTGTGTTT TTTCGAGAAC TG AAT T AAAT CATCTGTTCT CATC ATT TTG 8700 

CCATCATGAT TCATTATTTC ACAACATAGG CCACACTCTT TTAGTCCAGC TAATTTTAAT 87 60 

AAAT C AAC AG TTGCTTCTGT GTGTCCATTT CTTTCTAGGA CACCACCTTT TTTTGCAATT 8820 

AAAGGAAACA TGTGTCCTGG CCTGCGAAAA TCAGAGGGTG TTATATCTTC AG CT AC AC AC 8880 

ATACGTGCGG TCAGTCCTCT TTCCTCGGCA GAAATACCTG TGGTCGTTTC TTTATAATCA 8 94 0 
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ATTGAAACTG TAAAAGCAGT C T T AT G ATT A TCTGTATTGT TTTCAACCAT AGGTGAAAGC 9000 

ATTAATTGAT TAGCTAAACT TTCGCTCATA GGCATACAAA TTAATCCTTT GGCATAAGTA 90 6 0 

GCCATAAAAT TAACATTTTC TGTTGTAGCT GCTTGTGCAG AACAAATTAA GTCTCCTTCA 9120 

TTTTCTCTAT CCTTGTCGTC TATAACAAGA ACAAGTCGTC CCTTCTGCAA TGCTTCTAAT 9180 

GCTTCTTGTA TTTTTCGATA TTCCATTGAC TGATTATCCT TTCTGCTAAA ATCCATTTTG 924 0 

ATATAATAGT TCCTTAGATA TTTCTGATTT TGGAGAGTTA TCCATCAGTT TTTGCACATA 9300 

TTTACCTAAG ATATCATTTT CAAGATTTAC TGTACTCCCG ACTTGTTTAC TCTTAAGAAT 93 60 

GGTTTGTTCC AAGGTATGAG GGATAACAGA TACTGAAAAG TTTACTTTGG AGACTTTAGC 9420 

GACAGTCAGA CTAATGCCGT CAATTGTAAT AGATCCTTTT TCAACTATTA AATCTAAAAT 9480 

TTCTTTTTGT GTGTTGATTT GATACCATAC AGCATTATCA TCTTTTTTTA TTGACGAGAT 9540 

TTTTCCTGTA CCATCAATGT GTCCTGTAAC GACGTGACCC CCAAGTCGAC CGTTGACAGA 9 600 

TAAGGCTCTT TCT AG AT T C A CCTCACTTCC ATGTTTTAAT AGAGTAAGAG CTGTTCGACT 9660 

CCATGTTTCA TTCATTACAT CAACTGTAAA GGATTGATGA TTGAAATGAG TAACTGTAAG 9 72 0 

ACAGATACCA TTTACTGCTA TACTATCGCC TAAATGGATA TCCGTTAATA TTTTTGAGGC 97 80 

TTTAATTGAT AGTTTACAAT TACGAGAGTC TTTCTGTATT CTTTCAACTT TTCCGATTTC 984 0 

TTCAATTATT CCTGTGAACA TGGATAAATC ACTTCACTTT CTATGAGATA GTCATTTCCT 9 900 

AT TTG AG AAA ATGCATAAGG TTTCAATCTA ATAGCGTCAT TTGGCAAAGA AATACCTTCA 99 60 

CCTCCGACAG GAAACTTGGC ACTACCTCCA AAAACTTTTG GTGCAATATA TATTTTCAGC 10020 

TCATCAACAA TTTGTTGTTC CAAAGCACTC CAATTCATTA GACTGCCCCC TTCTAGAACT 1008 0 

AGGCTATCAA TCTGCATGTT TCCTAGATGT TGCATTAAAC TCGATAAGTC TATATGATTG 1014 0 

CCTTTTTTCT TTATGGAAAG TATTTCACAG CCATGATTTT GATATAGCTT CATTTTATTT 102 00 

TTGTCTTCAG AGGAAGTGGC AATGTAAGTT TTAATATCAT TTGCTGTTTT TACGATTTTA 10260 

GAGGTAAGAG GAGTTCGTAA ATGTGTATCG CATATGATAC GGATAGGATT TTTCCCTTCC 10320 

TCCAATCTAC ATGTCAGCAA AGGATCGTCT TGAATAACAG T ATTG ACT C C CACCATAATT 103 8 0 

GCACTAACAT GGTGTCGTAA CTGATGCACA TGCTTTCTTG CTTCTTCTTC AGTAATCCAT 10440 

TTGGATTGAT TTGTTTTAGT GGCTATTTTT C CAT CC ATTG AC AT TG CAT A TTTCATAAAA 10500 

ACATAGGGTA CATGCTGGGT AATATACTTT CTAAAACTTT TTATTAAGTT AAG AC ACT C A 10560 

TTTTCTAAAA TTCCAACAGT AACTTGAAGA TTATTTTCCT CAAGTATCTT TACTCCTTTT 10 620 

CCAGATACAA TAGGATTACA GTCTAGGCTT CCAATGACTA CTCTTGTAAT ACCACTATCG 10680 
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AT TAT AG CAT CTATACAGGG AGGTGTTTTC CCGAAGTGAC AACAGGGTTC AAGTGTTACA 10740 

TAAAGCGTCG CTCCGACAGG GGATTCTCTA CAGTTTTTAA GAGCATTTCT CTCAGCATGT 10 8 00 

GGGCCACCAA AAAACTCATG ATAACCTTGT CCGATAATGT GAT T AT CTTT TACAATAACT 108 60 

GCGCCGACCA TAGGATTGGG ATTGACGTAA CCAGCCCCTT TTTGTGCCAG TTTTATTGCT 10 92 0 

AATTTCATAT ATTTTGAATC GCTCATCTCG CTACCTCCAA AAAAATATAC CTTGAATAGG 10980 

GGACTACTCA AGGCATACAA AAGAAAACTT ATGCGATTAA CAAAAATGCT CTGAAATGAC 1104 0 

AAGTAATCAT TTCAGAGCAC GCAAAAAGCA CAAATATACT TTTATCTTCT TTCATCCAGA 11100 

CTATACTGTC GGCTTTGGAA TTTCACCAAA TCATGCCTTT CGGCTCGTGG GCTATACCAC 11160 

CGGTAGGGAA TTTCACCCTG CCCTGAAGAT AGTTATTCAA TTACAGATGA TTATAGTACT 1122 0 

TAATTTTGAA TATGTCAACA GATAAATACC GATTGTTTTT GATATACTGT ATTTGTGATA 112 80 

ATCGATTCTC GCTCCTCGGA TAAAGAAAAT ATGATATACT AG AT AAAC G A AATAAGAGAG 113 4 0 

AAGGAATACT ATGTACGCAT ATTTAAAAGG AATCATTACC AAAATTACTG C C AAAT AC AT 11400 

TGTTCTTGAA ACCAATGGTA TTGGT TAT AT CCTGCATGTG GCCAATCCTT ATGCCTATTC 114 60 

AGGTCAGGTT AATCAGGAGG CTCAGATTTA TGTGCATCAG GTTGTGCGTG AGGACGCCCA 11520 

TTTGCTTTAT GGATTTCGCT CAGAGGATGA GAAAAAGCTC TTTCTTAGTC TGATTTCGGT 115 8 0 

CTCTGGGATT GGTCCTGTAT CAGCTCTTGC TATTATCGCT GCTGATGACA ATGCTGGCTT 11640 

GGTTCAAGCC ATTGAAACCA AGAACATCAC CT AC T T G AC C AAGTTCCCTA AAATTGGCAA 117 00 

GAAAACAGCC CAGCAGATGG TGCTGGACTT GGAAGGCAAG GTAGTAGTTG CAGGAGATGA 117 60 

CCTTCCTGCC AAGGTCGCAG TGCAAGCAAG TGCTGAAAAC CAAGAATTGG AAGAAGCTAT 1182 0 

GGAAGCCATG TTGGCTCTGG GCTACAAGGC AACAGAGCTC AAGAAAATCA AGAAATTCTT 11880 

TGAAGGAACG ACAGATACAG CTGAGAAC T A TATCAAGTCG GCCCTTAAAA TGTTGGTCAA 11940 

ATAGGAGCAG AGAATGACAA AACGTTGTTC GTGGGTCAAG ATGACCAACC CGCTCTACAT 12 000 

CGCCTATCAT GATGAGGAGT GGGGCCAGCC CCTCCATGAT GACCAAGTAT TGTTTGAGTT 12060 

GTTGTGTATG GAAACCTATC AGGCAGGCCT GTCTTGGGAA ACGGTACTCA ACAAACGCCA l2120 

AGCTTTCCGA GAAGTCTTTC ATAGCTATCA AATTCACTCA GTCGCAGAGA TGACTGACAC 12180 

TGAATTGGAA GCCATGCTGG AGAATCCAGC TATCATTCGA AATAGAGCCA AGCTTTTTGC 12240 

TACACGCGCT AACGCCCAAG CCTTTCTACA GTTACAGGCA GAGTACGGCT CTTTTGATGC 12 3 00 

CTATCTTTGG TCTTTTGTTG AGGGGAAAAC TGTCGTTAAC GATGTTCCTG ATTATCGCCA 123 60 

AGCGCCAGCT AAAAC AC CCT TATCTGAGAA ATTAGCCAAA GATCTCAAAA AACGAGGCTT 12420 

CAAGTTCACA GGCCCAGTCG CCGTATTGTC TTTTCTACAG GCTGCAGGGC T AGTTG AT G A 124 80 
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CCACGAGAAT GATTGTGAGT GGAAAGGTCT TAAATGATGT CTAACAAAAA TAAGGAAATT 12 540 

CTGATTTTTG CGATTCTCTA TACAGTCCTC TTTATGTTTG ATGGCGTTAA ATTGCTGGCT 12 6 00 

TCTTTAATGC CATCTGCCAT TGCAAATTAT CTTGTTTATG TAGTTTTAGC TCTATATGGC 12 660 

TCCTTCTTGT T CAAGG AT AG ATTGATCCAA CAATGGAAGG AGATTAGAAA G AC T AAAAG A 12720 

AAATTCTTCT TTGGAGTCTT AACAGGATGG CTCTTTCTCA TTCTGATGAC TGTTGTCTTT 127 80 

GAATTTGTAT CAGAGATGTT GAAGCAGTTT GTGGGACTAG ATGGACAAGG TCTAAATCAG 12 840 

TCTAATATTC AAAGTACCTT TCAAGAACAA CCACTACTGA TAGCTGTTTT TGCTTGTGTC 12 900 

ATTGGACCTC TGGTAGAAGA ATTATTTTTC CGTCAGGTCT TATTGCATTA CTTGCAGGAA 12 960 

CGGTTGTCAG GTTTACTAAG CAT T AT TCTG GTAGGACTTG TTTTTGCTCT GACTCATATG 13 020 

CACAGTTTGG CTCTATCAGA GTGGATTGGT GCAGTTGGTT ACTTAGGTGG AGGCCTTGCC 13 080 

TTTTCTATTA TTTATGTGAA AGAAAAAGAG AAT ATC T ACT ATCCCCTACT TGTT C AC AT G 1314 0 

TTAAGCAACA GCCTCTCCTT AATCATTTTA GCTATCAGTA T AG T AAAATG AAATGAGAAC 13 200 

AGGACAAATC GATTTCTAAC AATGTTTTAG AAGTAGAGGT GTACTATTCT AGTTTCAATA 13 2 60 

TACTGTAATA TGTGATGAAA ATGCCAGTAA TGATACCGAG AAAAAAGCTG AGAAACTTTT 13 320 

CCCAGCTTTA TTTGTTATAG TCAAAGAGAA TGACTTGTTC CTGTGCATCT ACATGAGCAT 133 80 

GG AC C C C AAA GGGTACAATT GCTCTTGGAG TTGCGTGGCC G AC ATT C AG A T T AT AG AC AA 13 44 0 

TCGGGATATT GCTGTCAATG ATATCCAATA GTGCCTCTTT ATAGTCGTCA TGGAAAGTTT 13 500 

CATCCATAGG TTTTCCGACC AAGAGTCCAT TGATGACCGC GAATATGCCA GTGTCCTTTA 13 5 60 

AAGTTAGCAA CATCTTTTTG AAGTCTTCTG GCTTAGGCTT TTCTTCGCTT GTTTCGAGCA 13 62 0 

AG AGG AT T TT CCCTTCCCAG TCTGACAAGT CAGGGAAAAG TTTGTATTTT TGGCAGAGTT 13 680 

CCGTGCTATC TGCGTATCGA GAGTTGTCAA AGATATCGTA GAGGGATTCG AGGCAACCAC 13 74 0 

CGAGGATTTT CCCCTCGAAC TGGGCACTTC CTTGCAACAA GTCAAAACCT GTATTTGTAT 13 8 00 

GACTGACACG AGGTGTTCCC AGGGCCGTGG GACTAAAATC AGTTCGTTCC TCATACCAAA 13 8 60 

CGTCACTAGG GCGGATTTCT GAAATTCTTC CCGTCTCAAT CAATTCTTTA AAGTAGTGAA 13 920 

GGCTATAGGC TAGCATTTCT TTGTCTAATT CACAAATGTC TGCTAAAAAG GATTGACCAT 13 980 

AAAAAGTCTT GAT T C CT AAT TTATGCAACA TGAGGTGGTT CATGGTTGTA TCCGAGAAGC 14040 

CAAGAAAAAT TTTTTGCTTG ATAACCTTTT GGAGTTGGTC ATTTTCAAAA AGATAAGGTA 14100 

GCAAGCGATA GGTATCGTCT CCACCGATGG CACATAGGAT CATGTCGATG CT AT CAT C AG 14160 

AAAAGGCATG AATCAAATCC TCTGCACGAG CTTCAGGATG GTCCTTGATA AAGTCTAATC 1422 0 
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CTTTTAACGA ATGGGGCAAA AAGATGGGAT TGGTCCCAGA TCCTTGAGAC GTT 142 73 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GTGAAGTGCG GCAAAAGGTG CAAGTGATGA GCTCAGGTTC TTTAGCTCTT GACATTGCCC 60 

TTGGCTCAGG TGGTTATCCT AAGGGACGTA TCATCGAAAT CTATGGCCCA GAGTCATCTG 12 0 

GTAAGACAAC GGTTGCCCTT CATGCAGTTG CACAAGCGCA AAAAGAAGGT GGGATTGCTG 180 

CCTTTATCGA TGCGGAACAT GCCCTTGATC CAGCTTATGC TGCGGCCCTT GGTGTCAATA 240 

TTGACGAATT GCTCTTGTCT CAACCAGACT C AGG AG AG C A AGGTCTTGAG ATTGCGGGAA 3 00 

AATTGATTGA CTCAGGTGCA GTTGATCTTG TCGTAGTCGA CTCAGTTGCT GCCCTTGTTC 3 60 

CTCGTGCGGA AATTGATGGA G AT AT CGG AG ATAGCCATGT TGGTTTGCAG GCTCGTATGA 42 0 

TGAGCCAGGC CATGCGTAAA CTTGGCGCCT CTATCAATAA AACCAAAACA ATTGCCATTT 4 80 

TTATCAACCA ATTGCGTGAA AAAGTTGGAG TGATGTTTGG AAATCCAGAA ACAACACCGG 540 

GCGGACGTGC TTTGAAATTC TATGCTTCAG TCCGCTTGGA TGTTCGTGGT AATACACAAA 600 

TTAAGGGAAC TGGTGACCAA AAAGAAACCA ATGTCGGTAA AGAAACTAAG ATTAAGGTTG 6 60 

TAAAAAATAA GGTAGCTCCA CCGTTTAAGG AAGCCGTAGT TGAAATTATG TACGGAGAAG 72 0 

GAATTTCTAA GACTGGTGAG CTTTTGAAGA TTGCAAGCGA TTTGGATATT ATCAAAAAAG 7 80 

CAGGGGCTTG GTATTCTTAC AAAGATGAAA AAATTGGGCA AGGTTCTGAG AATGCTAAGA 840 

AATACTTGGC AGAGCACCCA GAAATCTTTG ATGAAATTGA TAAGCAAGTC CGTTCTAAAT 900 

TTGGCTTGAT TGATGGAGAA GAAGTTTCAG AACAAGATAC TGAAAACAAA AAAGATGAGC 9 60 

CAAAGAAAGA AGAAGCAGTG AATGAAGAAG TTCCGCTTGA CTTAGGCGAT GAACTTGAAA 1020 

TCGAAAT T G A AGAATAAGCT GTT AAAG C AG TGGAGAAATC CGCTACTTTT TCGATTTTTG 1080 

ATTCAAGTTT TTAGATTATA TATAGTAGCT TGAAATAAGA TATGAACAAC TCTATTAGGA 114 0 

AAGTCAAATT AATTTCTAGA AATGTTTTAG CAGCTACAGC GTACTATTCC AAACTCAACC 12 00 

AACTATAATA GATCGAAACT AGAATAGTAC ATATCTACTT CTAAAACATT GTT AAAAAT C 12 60 

GATTTGACTT TCCTTATTTC ATTCCGCTAT AT AT AGTTT G CTGTTTCTTG TCGCTCCTCT 132 0 

GGAAAGCTGA TATAATAGCT TTATGAATAA AAAACGAACA GTGGACCTGA TACATGGTCC 13 80 
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G ATT C TT CC C TCGCTCTTAA GCTTCACCTT TCCAATTTTG CTATCAAATA TTTTTCAACA 1440 

GCTCTATAAC ACTGCTGATG TCTTGATTGT TGGACGATTT CTTGGTCAAG AATCCTTGGC 1500 

TGCAGTAGGA GCGACGACAG CGATTTTTGA CCTGATTGTA GGTTTTACAC TTGGTGTTGG 15 60 

CAATGGCATG GGGATTGTCA TTGCTCGTTA TTATGGGGCT CGGAATTTCA CTAAAATCAA 162 0 

GGAAGCAGTA GCAGCCACCT GGATTTTAGG TGCTCTTTTG AGC ATT C TAG TTATGTTGCT 1680 

GGGCTTTCTT GGCTTGTATC CTCTCTTGCA ATACTTAGAT ACTCCTGCAG AAATTCTTCC 1740 

TCAATCTTAT CAATATATTT CTATGATTGT GACCTGTGTA GGTGTCAGCT TTGCTTATAA 1800 

TCTTTTTGCA GGCTTGTTGC GGTCTATTGG TGACAGTCTA GCAGCCCTGG GATTTCTGAT 1860 

TTTCTCTGCC TTGGTTAATG TGGTTCTGGA TCTCTATTTT ATTACGCAAT TGCATCTGGG 192 0 

AGTTCAATCC GCAGGACTTG CTACCATTAT TTCGCAAGGT TTATCAGCGG TTCTCTGCTT 1980 

TTATTATATT CGTAAAAGTG TGCCAGAACT CTTGCCACAG TTTAAACATT TCAAATGGGA 2 04 0 

CAAAAGCTTG TACGCGGATC TCTTGGAGCA AGGTTTGGCT ATGGGCTTGA TGAGTTCAAT 2100 

TGTATCTATC GGCAGTGTGA TTTTACAGTT TTCTGTTAAT ACATTTGGTG C AGTG AT TAT 2160 

TAGTGCCCAG ACGGCAGCTC GACGCATTAT GACCTTTGCC CTTCTTCCTA TG AC C G CT AT 2 22 0 

TTCTGCATCA ATGACGACCT TTGCTTCTCA GAATCTAGGA GCTAAGCGAC CTGACCGTAT 2 2 80 

TGTTCAAGGT CTTCGAATCG GCAGTCGTTT AAGTAT AT C C TGGGCAGTTT TTGTTTGTAT 2340 

TTTCCTCTTT TTTGCCAGTC CAGCTTTGGT TTCCTTCTTG GCTAGTTCGA CAGATGGTTA 240 0 

CTTGATAGAA AATGGAAGTC TCTATCTGCA AATCAGTTCA ACCTTTTATC CCATTTTGAG 24 60 

CCTCTTGTTG ATTTATCGCA ATTGCTTGCA GGGCTTGGGG CAAAAGATCC TTCCTCTAGT 2 52 0 

TTCTAGCTTT ATTGAACTAA TCGGAAAAAT CGTTTTTGTG GTTTTGATTA TTCCTTGGGC 2 580 

AGGATATAAG GGTGTTATCC TTTGTGAACC TCTTATCTGG GTTGCCATGA CAGTTCAACT 2 64 0 

GTACTTCTCA TTATTCCGTC ATCCCTTGAT AAAAGAAGGC AAGGCAATCT TGGCAACCAA 2 7 00 

AGTGCAATCC TAGTTGGATT TACTGAATAA AATCCATTTC CTCTAGTGAA AATCGAAAAA 2760 

ACTTGTGTTC TCTTCTTTAG TTTGGTGTTG AAAATAGTTT AACAGACTTT TGACTTCTTT 2 82 0 

TATATGATAT AATAAAGTAT AGTATTTATG AAAAGGACAT ATAGAGACTG TAAAAATATA 2880 

CTTTTGAAAA TCTTTTTAGT CTGGGGTGTT ATTGTAGATA GAATGCAGAC CTTGTCAGTC 2940 

CTATTTACAG TGTCAAAATA GTGCGTTTTG AAGTTCTATC TACAAGCCTA ATCGTGACTA 3000 

AGATTGTCTT CTTTGTAAGG TAGAAATAAA GGAGTTTCTG GTTCTGGATT GTAAAAAATG 3 060 

AGTTGTTTTA ATTGATAAGG AGTAGAATAT GGAAATTAAT GTGAGTAAAT TAAGAACAGA 312 0 



WO 98/18931 



PCTYUS97/19588 



400 

TTTGCCTCAA GTCGGCGTGC AACCATATAG GCAAGTACAC GCACACTCAA CTGGGAATCC 3180 

GCATTCAACC GTACAGAATG AAGCGGATTA TCACTGGCGG AAAGACCCAG AATTAGGTTT 3240 

TTTCTCGCAC ATTGTTGGGA AC GGT TGC AT CATGCAGGTA GGACCTGTTG AT AATGGT GC 33 00 

CTGGGACGTT GGGGGCGGTT GGAATGCTGA GACCTATGCA GCGGTTGAAC TGATTGAAAG 3 3 60 

CCATTCAACC AAAGAAGAGT TCATGACGGA CTACCGCCTT TATATCGAAC TCTTACGCAA 342 0 

TCTAGCAGAT GAAGCAGGTT TGCCGAAAAC G CTTG AT AC A GGGAGTTTAG CTGGAATTAA 3 4 80 

AACGCACGAG TATTGCACGA AT AAC CAACC AAACAACCAC TCAGACCACG TTGACCCTTA 3 54 0 

TCCATATCTT GCTAAATGGG GCATTAGCCG TGAGCAGTTT AAGCATGATA TTGAGAACGG 3 600 

CTTGACGATT GAAACAGGCT GGCAGAAGAA TGACACTGGC TACTGGTACG TACATTCAGA 3 6 60 

CGGCTCTTAT CCAAAAGACA AGTTTGAGAA AATCAATGGC ACTTGGTACT ACTTTGACAG 3720 

TTCAGGCTAT ATGCTTGCAG ACCGCTGGAG GAAGCACACA GACGGCAACT GGTACTGGTT 3780 

C G AC AAC T C A GGCGAAATGG CTACAGGCTG GAAGAAAATC GCTGATAAGT GGTACTATTT 3840 

CAACGAAGAA GGTGCCATGA AGACAGGCTG GGTCAAGTAC AAGGACACTT GGT ACT AC T T 3 9 00 

AGACGCTAAA GAAGGCGCCA TGGTATCAAA TGCCTTTATC CAGTCAGCGG ACGGAACAGG 3 9 60 

CTGGTACTAC CTCAAACCAG ACGGAACACT GGCAGACAAG CCAGAATTCA CAGTAGAGCC 4 02 0 

AGATGGCTTG ATTACAGTAA AATAATAATG GAATGTCTTT CAAATCAGAA CAGCGCATAT 4080 

TATTAGGTCT TGAAAAAGCT TAATAGTATG CGTTTTCTTG T GG AG AT AT T TCCTTCAATT 4140 

TTGCTACTAT ATTAAACAAA AATCAAAAAG CAAACTAGAA AGTTATGCTC AAATAAAATC 42 00 

TAAATTTGAC AATGTAAACC GAGTCGGATA GCTTTAAGTA CTGTTTTGAG GTTGAAGATA 42 60 

CGATTTTTGA TAGGAACTCA TCAATTTTAG ATTTTTAAGC AGCATCAATA AATTGCTTCC 4320 

TTGTTTTGTC ATAATTTTTT TATTTAAAAA ATTATGACma GAGTGTGCTA TTCTTTTTAT 43 80 

GAGAGGTGTA TGAATATGAT AAATGTATGT GATAAATGTA TGTGATGTTG GAAAAAGAAT 444 0 

AAAAGAACTT AGAATATCTT CAAATCTTAC TCAAGATAAG ATTGCTGAGT ATTTGTCTTT 4500 

GAATCAAAGC ATGATTGCCA AAATGGAAAA AGGTGAAAGG AAT AT C AC G A ATGGATTTAA 4 560 

GTAATAAAGC TTCAAATCTT AGAAAAAAGT TGGGAGCTGA TGGTGAATCG CCGATAGATA 4 620 

TTTTTAAATT GGTACAAAAG ATAGAAAATT TGACGCTGGT ATTTTATGGA CTCGGAAAGA 4 680 

ATATTAGCGG AGTCTGTTAT AAAGGAACTC AGTTCAGTCT CATTGCAGTC AATTCAGACA 474 0 

TGCCATTAGG AAGGTAAAGA TTTTCTTTAG CACATGGACT GTATCATCTT TATTATGATG 4 800 

AGGTGAAGAA GAGTTCAGTC AGTCTTATCT TGATTGGTGA AGGAGATGAA ACTGAAAGAA 48 60 

AAGCGGATCA GTTTGCTTCT TATTTTTTAA TTTTCCCATC TTCACTGTAT AGGATGGTTG 4 92 0 



WO 98/18931 



PCT7US97/19588 



401 

AGGAAATCAG AGAAAATGCC AATAGAACTC ATCTTGAAGT AGAAGATATT ATAAAATTGG 4980 

GTCAGTTTTA TGGTATCAGT CATAAAGCTA TGTTATATAG ATTGAGGAAT GATGGATACC 5040 

TTGATGCAGA AGAAATTAAA AATATGGATA TTAGTGTTAT AG AG AC AG CT TCAAGATTAG 5100 

GCTATGATAC AAGTTTATAT CGTCCTTTGT CAGAAAGTAA AAAAGAAATG GCATTAGGAT 5160 

AATATATTAA TTCAACTGAA CAACTTTTAG AAAATAACAG AATTTCGCAA GGGAAGTATG 5220 

AGGAACTGTT ACT AG ATGC T TTCAGATATG AT AT TGT AT A TGGGCTAGAT GAAGAGGGGG 52 80 

GAGTTGTCGT TTGACTAGTC GTGTATTTAT TGATGCAGAT TGTATTTCAG TATTTTTATG 5340 

GGTTGGCACT GAACATCTTT TAGAAAAGCT CTATTTGGGT AAAATTGTTA TTCCACAAGA 5400 

GGTGTATGAT GAAATCAATA TACCTACAAT TCCCCATTTA AAATCTAGGA TAGATCAGTT 54 60 

GGTAGCTAAG GGTTCAGCTG AGATTGTGAG CATAGACATT GGAACTGAAG AATACGCATT 5 52 0 

ATATAGAGAT TTAACAAGAA AT CAT GAT AG TAACAAGATT ATTGGTAAGG GAGAAGGGGC 5 58 0 

ATCTATTTCC TTAGCGAAAA AGCATAATGG GATATTAGGA AGTAATAACC TAAGAGATGT 5 64 0 

TAAATCATAT GTAGAAGAAT TTTCTTTAGA ATATATGACA ACAGGAGATA TACTGATTGA 5700 

AGCGTTTAAA GCGTAATTTA TTACTGAATA AGAGGGCAAT CATATCTGGA ATAATATGCT 57 60 

TAAAAAGAGA AGGAAAATTG GTGCAAATTC ATTTTCAGAC TATCTTCGTG GAAGTATTCA 5820 

TCAAAATAGA CAAAAATAAA TTTGGATAAA TCGAACTCAC TATTCAGGAG GCATATGAGC 5 880 

AATTCGAAAA AG AAAAGT GT CAAATTGAGC CTATAGGAGT AGAAGTGAAA TAGTAAGTCC 5940 

TGCATAGTGG ATGAGAGAAA AGTTCTCCTT GAAGTTTTCC TGAACTATCA GTCGCATGTC 6000 

AAACG AT AT G TAGGGTAATG TGAGAGGGGA TAGCGAGTAG TTTTTGGTTA TTTTATCAAA 6 0 60 

AAAC TT AT AT TTTATTATAC CGAATGATAA AATATAATAA AAATGATAGA ATAAGGAAAA 612 0 

AACATGAATG TCAAAAAGAT AATGTCAATT TTTCAATCCT TTTATGTTGA TGTCAGTATT 618 0 

GAGGAACTGA CTTTGACTTT ACCAATCAGT TTTGTAAAAA GGTTTGAGTA TACTCAAATG 62 4 0 

ACTTTTCATA AGGAATCATT TTTATTGATT AAAGAAAAGA GAAGGGGGAG TTTGAGTTCA 63 00 

TTTGTTACTC AGGCTCGCAC TATGGGTGAA AAAGCCAATA TGGATGTTGT TTTGGTGTTT 63 60 

TCGAAGTTAT CAGACAGTGA AAAAAAGCAA TTACTTCAAG CTAGAGTTCC GTTTGTAGAC 6420 

TTTAAGGGAA ACCTCTTCTT CCCTCCATTG GGACTAGTAC TCAATGCGAA TGATACTGAA 6480 

GTCCCTAAGG AATTAAC AC C TAGCGAACAA TTAACGTGGA TTGCCTTTTT ATTGACAAAA 654 0 

GGTCAAAAAG TAGTAGATGT TGATTTGCTT TCACAAGTCA CTGGACTTCC AAACTCAACA 6 6 00 

ATTTATAGGT GTTTGAGGAC TTTTAAAGCT TTATATTGGT TAAACAAGCA AAATAAGCTT 6660 
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TACACATATA CGGTGTCAAA GAAAGAATTA TTCTTAAAAT CCGTGTCATG TTTATTTAAT 672 0 

CCCATCAAAA AACGGATTTT ATTGCCAGAT GGCGATATAA AG C AG AT AAA ATCTGTTTCT 6780 

AACCTTCTAT ATGGTGGTGC TTATGCTTTG TCGCATTCAA CTTTTTTAGC TGAAACGGAT 6 84 0 

G AAAAT AT T A GCTATGTCAT ATGGCAGAGA AAATTCAATC AGTTATCCTT GCCACTTTCT 6900 

CAGCATGTTT TAAAATGAAA GAT GCTAGAG ATATGGAAAT ATCGTCCTTT TGTATCTGAG 6960 

TTTTGGAATG ATTTTAAAAA TAATCATGAT AAACAATTTG TAGATCCGAT TTCTCTTTAT 7 02 0 

TTGACCTTAA AAGATGATGA TGACCCACGT ATAGAGGAAG AGAGTGAAGC AC T AGAAAAT 7 080 

ATGATATTAC AGTATCTGGG AGAAGATGAT GCCAGCTAAT ACGAAAGTTA TTTTTCAAGA 714 0 

AATGTTTGCG GATTTTCAGA ACTATTATGT TCTGATTGGG GGAACTGCTA CCTCTATCGT 7 200 

ATTGGATTCG CAAGGATTTA AAAGTCGCAC AACAAAAGAT TATGATATGG TCATCATTGA 72 60 

TGAAGTAAAA AATAAGGAAT TTTATACTAC CTTGAATCAT TTTTTAGAAT TGGGAGAGTA 7320 

TCAAGGAAGT CAGAAAGATG AGAAAGCGCA GCTTTTTCGA TTTACAACAA CTAATCCTGA 73 8 0 

GTTTCCTTCT ATGATTGAAC TATTTAGTAT CTTACCAGAA TATCCATTAA AG AAGG AC GG 7440 

TCGAGAAATT CCCTTACATT TTGACCAAGA TGCTAGTTTA TCAGCCTTAT TATTGGATGA 7 5 00 

AGATTATTAT AAT AT AT TGG TGCATGAAAA AGAAACCATT CAGGGGTATT CGGTATTGAG 7 5 60 

TAATTGTGGT TTATACTCTT CGAAAATCTC TTCAAACCAC GTCAGCTTCC ATCTACAACC 7 62 0 

TCAAAACAGT GTTTTGAGCA GCCTGCAGCT AGCTTCCTAG TTTGCTCTTT GATTTTCATT 7680 

GAGTATTAAT TATTTTTAAG GCTAAAGCTT GGCTGGATAT GAGGGAGCGC TCTGCCACAG 7 74 0 

GTGCTCAAGG TTTAAGTAAG TCCATTAAAA AGCATTTGAA TG AC C T T AC C CGTTTGACAG 7 800 

CTTCCTTGCT AGGAGATGAA AAGTTATCGG CTATAACATC AAGTAGTGCG GTAAAAGCAG 7 86 0 

ACATGCACCG CTTTGTGATA GAATTAGAGC CTGTGAAGTC AACTATTCTT C AAAAT AAT G 7 92 0 

ACATTTCATT GGATCAAAAT GAAATTTTTG AAATTCTGAA AAATTTTCTC GATGGTTAAA 7 980 

ATAATTGTAG CGAGATGGCT ATATTGAATT CGTCTATATC TGG AAAC TAG AAAAAACTTC 804 0 

AATTTCAGGA GAAAATGAAG TCAATCTTCC CACAATCAAA CGTATAGTAT CAAGGTTTTT 8100 

CAAGACCTGA TATTATGCGT TTTTTGCTTT TCAAAACTTT TTGCCCAGTC TTCGTTTTTA 816 0 

TCCTCTAGTC ACTTGATTTG TTTCAGGTGG TTTTTTAGTA TAGTAGAATG AAACGAGAAC 8220 

AGGACAAATT GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAAGTGT 8280 

ACTATTCTAG TTTCAATCTA CTATAGTTAA ATCTGCGGTC AAGTCTACTG GTGAATCTAT 8340 

GATTGTAATA CTCTTCCAAA ATCTCATCAA CCACGTCAGT CTTGCCTTGC AGTCTGTATC 8400 

TTACTGACCA AGCTAGTGAT GGATTTAGAA TAGGTGATTT GGAGCGTCCT ATTAGCTAGG 84 60 



WO 98/18931 



PCT/US97/19588 



403 

AAATGCTGCT CATAGTCCTT TGCTGAGGCT AGGGTGTTTC AACATTCAAC ACTCAACTGG 852 0 

TTGATCTAGT TGATAGGAAG GGAGTTACTA TAAAATACTC AGGCTTCCAT CATATTTTTT 858 0 

GAAACGATTG TGTAATCAAA ATGTACCAAT ATTGT AG T AT TGGTACAGAA GATGTTGTGA 8 64 0 

ATGGATAAAT ATATCATAAC TGCTATCTCA AAAAGATTTC ATATGTCTGT GCATATATAA 87 00 

TAGACTTCCT GCAAAACTAG AATCCTAGTT CATGATTGAT AATACCAGCA ATCAAATTCA 87 6 0 

TTCGTAATCC AAAGCGTTTA CGATGATTTC GATAGGTTGT TGAAAACATT TTAAACGTTT 882 0 

CTACTTTGGC AAAGATGTTC TCAACCTTGC TTCTCTCCTT AG AT AG C G C A TGGTTATAGG 8880 

CTTTATCTTC AGCTGTTAGC GGCTTGAGTT TGCTGGATTT ACGTGGAGTT TGTGCTTGAG 8940 

GACATATCTT CATGAGCCCT TGATAACCAC TGTCAGCCAA GATTTTACCA GCTTGTCCGA 9000 

TATTTCTGCA ACTCATTTTG AACAACTTCA TATCATGACA AT AGT T C AC A GTG AT AT C C A 9 06 0 

AAGAAACAAT TCTCCCTTGA CTTGTGACAA TCGCTTGAGC C TT C AT AG C G TGAAATTTCT 912 0 

TT T T AC C AG A ATCATTCGCT AATTCTTTTT TTAGGGCGAT TGATTTTTAC TTCCGTCGCA 9180 

TCAATCATTA CCGTGTCCTC AGAACTAAGA GGAGTTCTTG AAATCGTAAC ACCACTTTGA 92 4 0 

ACAAGAGTTA CTTCAACCCA TTGGCTCCGA CGGATTAAGT TGCTTTCGTG AATACCAAAA 9 3 00 

TCAGCCGCAA TTTCTTCATA AGTGCGGTAT TCTAGGCTTA ATTTAGGTTT TCGTCCACCT 93 60 

TTTGCGTGTT TAAGTTGATA AGCTGTTTTT AATACAGCTA ACATCTCTTT AAAAGTCGTG 94 2 0 

CGCTGAACAC CAACAAGACG CTTAAATCGT GTATCAGTTA ATTGTTTACT TGCTTCATAA 94 8 0 

TTTCGCAGGG AGTCTATTGA CTCTTTGGTA GGTGTCAATG TTTTTTTCAT CTATCCCGAG 9540 

AATTATTTTC CCGCCATTTG TATTTGCAAA TGCTGAGTAG GTTTCCCAGA AAGACTCTGG 9 600 

AAGATTGTTT TTAGCTTTTT TGTATTCTAA ATCAACCCCT TCAAATTTTA AGT C CAT AT T 9 6 60 

TTTCCTTTAC ATCTGTTTTT TGTGGTTCTG GTATTTGTTC AAGTTGAGTG AT AAT AT AG C 972 0 

GAATTGAATT T CG AG AGT TT T T ACT C AG T T AATTTCTTTT TTAACCCACT TTAATTGCTT 97 80 

TTTTAACACG GGTTAAAAAA GAAATTAAAG TGGGTTAATT TTTCTTGA 9 828 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 42: 
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CCGCGAAAGA TATTTTTGAA CAAGAGTTTG GACGTGAGGT CCGTGGCTAT AATAAAGTAG 60 

AAGTTGACGA GTTTTTAGAC GATGTCATCA AGG ACT AT G A AACCTATGCT GCCTTGGTCA 120 

AGTCACTTCG TCAGGAAATT GCGGATTTGA AGGAAGAATT AACTCGTAAA CCGAAACCTT 180 

CACCAGTTCA AGCAGAACCC CTTGAAGCGG CAATTACAAG TTCTATGACG AATTTTGATA 240 

TTTTGAAACG CCTGAATAGA TTGGAAAAAG AAGTTTTTGG TAAACAAATT TTAGATAACT 300 

CAGATTTTTA AGTAGTTATT TGAGATGTGC AATTTTTGGA TAATCGCGTG AGGAGAATTG 3 60 

TTTCTCATGA GGAAAGTCCA TGCTAGCACA GGCTGTGATG CCTGTAGTGT TTGTGCTAGG 420 

CG AAAC CAT A AGCCTAGGGA CGAGAAATCG TTACGGCAGT TGAAATGGCT AAGTCCTTGG 4 80 

AT AGG CC AG A GTAGGCTTGA AAGTGCCACA GTGACGGAGT CTTTCTGGAA ACAGAGAGAG 540 

TGGAACGCGG TAAACCCCTC AAGCTAGCAA CCCAAATTTT GGTCGGGGCA TGGAGTACGC 600 

GGAAACGAAC GTAGTATTCT GACTGCTATC AGCTAGAGCT GTTAGTGGTA G AC AG AT GAT 6 60 

TATCGAAGGA AGTGGTCCTA GTCACTTCTG GAACAAAACA TGGCTTATAG AAAATTGCAT 72 0 

ATAGGTTGGG GCTGAGAAAT TTTCTCAACC TCATTTTTTA AAGTGGACAT ATAGAAAGGT 780 

CTTGCAAGAC TGTAACATGA AAAAAGAATT TAATTTAATT GCAACTGTGG CAGCAGGGCT 84 0 

TGAGGCTGTC GTTGGTCGTG AAGTGCGAGA GTTGGGCTAC GATTGTCAGG TTGAAAATGG 900 

ACGTGTTCGT TTTCAAGGAG ACGTGAGAGC T ATT AT CG AA ACCAACCTTT GGCTTCGGGC 9 60 

AGCAGATCGT ATCAAAATTA TCGTAGGAAC GTTCCCAGCT AAGACTTTTG AAGAGCTATT 102 0 

TCAGGGAGTT TTCGCTTTGG ATTGGGAAAA TTATTTACCA CTTGGAGCTC GGTTCCCGAT 1080 

TTCAAAAGCT AAATGTGTTA AGTCCAAACT TCACAATGAG CCCAGTGTTC AGGCTATTTC 114 0 

TAAGAAAGCT GTTGTCAAGA AATTGCAGAA AC AC T AT GC T CGCCCAGAAG GGGTTCCTCT 12 00 

GATGGAGAAT GGCCCAGAGT TTAAGATTGA GGTCTCTATT CTCAAAGATG TGGCAACTGT 12 6 0 

CATGATTGAT ACGACCGGGT CTAGCCTCTT TAAACGTGGT TATCGTACCG AAAAAGGTGG 13 2 0 

CGCTCCTATC AAGGAAAATA TGGCAGCAGC CATTTTACAA CTTTCTAACT GGTATCCAGA 13 80 

CAAGCCTTTG ATTGATCCGA CCTGTGGTTC GGGGACTTTC TGTATTGAGG CAGTTATGAT 1440 

TGCTAGAAAG ATGGCGCCAG GTCTTCGTCG CTCTTTTGCA TTTGAGGAAT GGAACTGGAT 1500 

CAGCGATCGC TTGATTCAAG AAGTGCGCAC AGAAGCGGCT AAAAAAGTAG ACCGTGAGCT 1560 

TGAGCTGGAT ATCATGGGCT GTGATATTGA TGCTCGCATG GTGGAAATTG CTAAGGCCAA 1620 

TGCTCAGGTA GCTGGTGTTG CAGGAGACAT TACTTTTAAG CAGATGCGCG TGCAGGATTT 16 8 0 

ACGTTCCGAT AAAATCAATG GAGTAATCAT TTCCAATCCG CCTTATGGTG AACGTTTGTC 1740 

AGATGATGCA GGGGTGACCA AGCTCTATGC TGAGATGGGG CAAGTATTTG CACCGCTGAA 1800 
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AACTTGGAGC AAATTTATCC TGACTAGTGA TGAAGCTTTT GAAAGCAAGT ATGGTAGCCA 18 60 

AGCAGATAAG AAGCGTAAGT TATACAACGG AACCTTGAAA GTGGATCTAT ATCAATATTT 192 0 

TGGTCAGCGT GTCAAACGGC AAGAGGTAAA ATAGAAAGGG AT ACT C ATG A GTAAAAAAAG 19 8 0 

ACGAAATCGT CATAAAAAAG AAGGTCAAGA ACCGCAATTT GATTTTGATG AAGCAAAAGA 2040 

GCTAACAGTT GGTCAAGCTA TTCGTAAAAA TGAAGAAGTG GAATCAGGAG TCTTGCCTGA 2100 

GGATTCCATT TTGGACAAGT ATGTTAAGCA ACACAGAGAT GAAATTGAGG CGGATAAGTT 2160 

TGCGACTCGT CAATACAAAA AAGAGGAGTT CGTTGAAACT CAGAGTCTGG ATGATTTAAT 2220 

TCAAGAGATG CGTGAGGCTG T AG AG AAGT C AGAAGCTTCT TCGGAGGAAG TTCCATCTTC 2280 

TGAAGACATC TT AC T AC C C T TGCCTCTGGA CGATGAGGAG CAAGGCTTGG ATCCTCTATT 2 340 

GCTAGATGAT GAAAATCCAA CAGAAATGAC TGAAGAAGTG GAAGAGGAGC AAAACCTTTC 24 00 

TCGTCTGGAT CAAGAGGACT CAGAAAAGAA AAGTAAAAAA GGCTTTATTT TGACCGTTTT 2 4 60 

GGCGCTTGTA TCAGTAATTA TTTGTGTCAG TGCTTATTAT GTCTACCGTC AAGTGGCTCG 2 52 0 

TTCGACTAAG GAAATTGAAA CTTCTCAATC AAC T AC AG C C AATCAATCGG ATGTGGATGA 2580 

TTTTAATACA CTTTATGACG CCTTTTACAC AGATAGCAAT AAAACGGCTT TGAAAAATAG 2 640 

CCAGTTTGAT AAACTGAGTC AACTCAAGAC TTTACTTGAT AAGCTGGAAG GTAGTCGTGA 27 00 

ACATACGCTT GCCAAATCTA AAT AT GAT AG TCTAGCAACG CAAATCAAGG CTATTCAAGA 27 60 

TGTCAATGCT CAATTTGAGA AACCAGCTAT TGTGGATGGT GTGTTGGATA CCAATGCCAA 2 82 0 

AGCCAAATCG GATGCTAAAT TTACGGATAT TAAAACTGGA AATACGGAGC TTGATAAAGT 2880 

GCTAGATAAG GCTATCAGTC TTGGTAAGAG CCAGCAAACA AGTACTTCTA GCTCAAGTTC 2 940 

AAGTCAAACT AGCAGCTCAA GTTCAAGTCA AG C AAGT T C A AATACGACTA GTGAGCCAAA 3 000 

ACCAAGTAGT TCAAATGAGA C TAG AAGT AG TCGCAGTGAA GTCAATATGG GTCTCTCGAG 3 060 

TGCAGGGGTT GCTGTTCAAA G AAGTGC C AG TCGTGTTGCC TATAATCAGT CTGCTATTGA 312 0 

TGATAGTAAT AACTCTGCCT GGGATTTTGC GGATGGTGTC TTGGAACAAA TTCTAGCGAC 318 0 

TTCACGTTCA CGTGGCTATA TCACTGGAGA C C AAT AT AT C CTTGAACGTG TCAATATCGT 3240 

TAACGGCAAT GGTTATTACA ACCTCTACAA GCCAGATGGA ACCTATCTCT TTACCCTTAA 3300 

CTGTAAGACA GGCTACTTTG TCGGAAATGG CGCTGGTCAT GCGGATGACT TAGATTACTA 3 3 60 

AGCAGTCGG 3 3 69 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9713 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
AAGTTTACAA TTTAAATGAA TTAACAATTT TCCCAACTAA AAGCACTCCA GTTACCGCAA 60 
CGTTTGTACT GAATGTACTA AATCGCATTC CATCAACTTC ATCTGTTTCG TCAACTTGAA 12 0 

C AG AT AC TAA TTGAAGATTT AATACTTCTG CTGCCATAGC TAGCTCCTCC TATTTAAATT 18 0 

TTTGGGATTA AGTACTTTAT CCACCCTCAT ATACTCTCTC CACCAGTAAA ATGCAAGCAA 240 
TGATACAAAA TAGATTTAAC TATTTTATAT AG CG AAAACT TACAAATTTT TAAGAAATAA 3 00 

TTTTTGCATT CTTAAAGATA AAATAGGAAC TTTTAGTAAT AAAT AT T AAA ATAAATAAAA 3 60 

TAATAGATAC TATAAAATTT GGAAGTATTA ACCCCAAAAG ATT CAT AT C A TCTATTAAAA 42 0 

TATCCTCTAA AGAGTAGTAT AT T AAAGC C A TAATTTTAAT GT T AAGT AAA AATGCAATTA 4 80 

ATGAAGTAAC AAATGTCAAA AATATAGCCT CACCAACTTT AATCTTAACC ATCTGGTAAT 540 
TAGAAGTTCC TAAAATTTCA AATTGCTGAA TCTCAATCCT TTCTTGATGC GATGACAAAA 600 
ATGCAATTGA AATAATATTT GCAAGTACTA TCAAAATTGG TGCTCCTACA TAGACAATAA 660 
ATGCTACTTT TAGCTCTAAA TCACTGTCAT CTTGAAATTG AGATAGTATA TTCTGAGAAA 720 
TCATTTGAAA ACTAGAAATT AGTAATATAG CTCCTGTAAT TGCAGCACTG ATAGATTTTA 7 80 

TATAAGACTT ACAATATAGT AAATTCCACT TCGAAACAAT GAACATAAAA TTATTTCTAA 84 0 

ATATAATTAT AGAAAGTAGT TTGATAAAAC ATGACTGTAT AAAAGGAGAT AAT TG AT AAA 9 00 

TAATCACAAT ATCTAAGATT AC AAT AT TG A ATATTATCTG GGCCTTCGCT AAAATTGTGC 9 60 

TATCTTGGAA AATTTGTTGC AAAGAAAGCA ACCAGATAAC ACTAAAACCA GCCAATAGCA 102 0 

GTATTCTTTT TACTATTGAA AGAACATGCC TT AT TTT AG A ACTCTTCCTA TTTCTAATCT 1080 

TCTTGAACGT ATAAAAGCAA C C ACT T AG AA AGGCTAAAAA TGAAATCAAC ACTACTGTAA 114 0 

TGATACATCC AAC AGC ACT C GTTTGAAATT GGATATCAGG T AAT AT ATT T TCCCCGAAAA 12 00 

AGTATTGTAA AAAATAATAA TAATTTGACG TAACAAATAT AGAGCATAGA TATGCAATAA 12 60 

AACTAATAAT CGAGGAAATG ATAAAAATCT GTCCCCCCAC AAGAAAT GAT AGTTGAAGGC 132 0 

GACTTGCTCC CAACACCTCC AGAAGTTCGT AAT CATC TCT AAAAATTTCA ACCAACATAT 13 80 

TTATTATGTT AGAGAGCACA AAGAATAATG TTACTCCTCC G AAT AC TAT C GGAAACATAA 1440 

AAATTGGTTT AGGATCTGGA AGTCCGACAA ATACTTGCGA ATTATTCTCA ACATTAATTA 1500 

CCCCATTAAC AGCCAATCCC ATAACTAAAC TCGAAACAAA AATTACTGGT GAAACGCCTA 15 60 
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ACCATTGTTT CTTATTATGT AAAAAT TGAT AGTAAACTAA TCTGAGCATC TCTATTCCTC 162 0 

CGTAGTTGAT TGTACCTCTA AGATTTTATA CAACTCTTCC CCGCTAGGTC TATGAAGTTC 1680 

TTTGAAAATT TTTCCATCTT TCAATATTAA TGCACGATCA GTTTTCGAGG CCAATTCTAT 174 0 

ATCGTGCGTT ACCATAATTA CACACTTACC CGCCCCTACT AACTGTCTCA ATAATTCAAA 1800 

AATTACTTCA CGAGAAACGC TGTCTAAAGC CCCAGTTGGC TCATCAGCAA AT ATT AT AT C 18 60 

ACT ATC AG C A ATAACCGCTC TAGCTATAGC AACCTTCTGT TGTTCTCCAC CAGACAGAGT 192 0 

TCCAACAAAA TCGTTTAAGC CAGCATTAAA CTTCATTCTT TTGAGTAAGT TTTCTACATT 1980 

TTTAATAGTT AATTTTTTTT GTGATAATCG CAAAGGAAGT GCTATATTTT CTATTACCGG 2040 

CAGGGAAGGT ATTAAATTGT ATGCTTGAAA TATAAAAGAT ACTTCGTTAC GTCTTATACT 2100 

TGACAATTTT GCATTTCTGA TTTTATAGGG GTTGATTCCA TTTAAAATTA CTTCCCCACT 2160 

TGTTGGTTCA AG C AAACTAG AAAT AC AT T T TAATAAAGTT GACTTTCCAG AACCACTAAT 22 2 0 

TCCTAGAATA CTTATAAATT CTCCTCTCGA AGCAGAAAGA GAAACATTTT TCAGCACTTG 2280 

CAACGTTTTA TTATTTCCTA GTAAAAATTG ATGATACAGC CCTTTCACTT T T AAT AT AT A 2340 

ATCTTTATCC ATATTCTTGC CTCCAATCAC TTAATTTTGA AAAGTGTTCC ATTTTCCAAT 24 00 

TTATATATAT CAGTGTATCT CTTGTCATTT AAGTCATAAT GATGTGAAAC TT C AAT AAAT 2460 

GAAATACCTA AATTGAACAG AATATCATGT ATGGAATTTG AAT TAT C AT T ATCTAAATTA 2 52 0 

GCTGATATTT CGTCAAATAA GT AC ACT T T A TTATTTCTAA TCAGAGCTCT AGCTAAAGCT 2580 

ATTTTTTGTT TTTGACCTCC AGACAAATTA CTACCATTTT C AC C AC AT TG ATAATTTAGT 2 64 0 

AT AT C T AT CT TTTCTAATTC TTCATATAGA TTTACCTTTT TTAACACCTC AATTATCTGA 2 700 

T C AT CTG AAA AAT AT T C ATT TTGAAATAAA GTTACGTTCT CACGAATAGT AGTGTCAAAA 27 60 

ATATATGGTG TCTGATCAAC TGTTGGTATT GAATCTGAAC TCTTTTTCCC ATGTGATAAC 2 820 

AAAT T T AC AT AACCTTTTTG TGGCTTTAAA GAACCATTAA TTAAATTTAA AATCGTTGTT 2880 

TTCCCACTAC CAGAAGTTCC TGTTAATAAT ACC C T AAAT G GTGACTTAAA TGAGAAGTCA 2 940 

ATACTTAATT TATTTTCTGG TGTAATAGAA TATACAACAT CTTTCATGTG TATCTCATCT 3 000 

ATTGATGAAG TATACAGTCC GTTATTATCA TGTTCAGCGT CT AT AAAAT T CTTCTCTCCA 3060 

CTTAAGTATT TTAAAAACGG TTTCCTTAAA TCTTTGGTTG TATTTATCTT AT TT AATG AA 312 0 

TAGGCAATTG ATTGTATCGG CCCTAAAACT TTATCGTTTG C TAAG AAAAT ACCTATCAGT 318 0 

TCACTAAAAG AAAGGCTTTT ATG AT AAAT T ACAAAATAAC AT C CT AC AAC CAAGGGAACT 324 0 

AGAAAGCAAA AAC CTG AAAT TAGTACTGCA ACCAATTTTG AAAGAACCTC TGATCGTTTC 33 00 
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AAATTAAAAG TAGAATCTTC TAGTTTATCC AACTTTTTAT CCGACAAACT AATTATTTCT 3 3 60 

TTAGTAACAG AATAAGATTT TAATGTCTTA AAACCATTAA AAATTTCTTT TATTATGTGA 3 42 0 

GTATACTCTG CATTGCTGTT AGAGTACTCA TTAGCTGAAT TAGACAACAT CTTCTTCATA 3 4 80 

AAGACAGGTA CTATAATCGG CAATGCTGAT AATACAATAA ATATTATTGA nACTAGGAAG 3 54 0 

TT T AAAT AAA G C AT AAAACT TAGAGAGACG ATGAACAACA ATATTGAAGA AATTATTTCA 3 600 

AAAATTTGTC TAAAATAGTT TTCTTCGATT AATCT C AAAT CAT T T G AC AA AACTGAAATA 3 6 60 

ATAGATGAGT AATCTTTAAC CATTTCAGAA GAAAGATACT GTTCTCTAAA ATATCCTTGT 372 0 

TTAATTTTTA CAT T TAT AT C TTTAGTTATT GATGCTTCCG TTACTTCTAA ATAGTAATTT 37 80 

GATATATAGA TTGCTGACCA ACCCAGAATA CTTATAGCAC CAAATCTTAG AACGTCAGAA 384 0 

AATGAGGAAG TCTGATTTAA ACTACCTGCA TATACAATAA TTCCTGAGAG CAAGACACCA 3900 

TTAAACGAAG ATAGAAATAT TAAAATCCCC ATTAATATAA GTTTAGTCTT T TTT AT AAAT 3 9 60 

TTTAAATAAT TCATAAGTTA TTCCTTCCCA CTTCTTCAAA GAAATAATTT AAAGTATCAA 4 02 0 

T CAT T AAG AG AACATCTGAT GGAGTAAAAC CTCCATGACC AGCTGCTTTG TTT AAAT ACA 4080 

ACAAACTTTT AACTCCAATA GAATTTAATT TCTTTGACCA CTCTATCACT TCGTTATTAT 414 0 

TAATATATGG GTCTTTCTCA C C C AAAAT AT TAACTATAAC AGTATTTGAG TCTCGTGCCT 42 00 

TTTCAATATT TTGCATAGGC GAATATGACT TTATATAAGC CTTTACTTCA GGGTCTCTAA 42 60 

TATCTCCCCA CTCTGCTATT TCGGTCTTAG AAAGAGG AT C ATTTGGATTC TGAAGTGTAT 4320 

CATAAGGATT TATAAATGGC G AAAAT AAGA GAATGCTTTG CAATAAATTT TTTTCCTCGT 43 8 0 

TCAACACCGC ACCAGCAATT ATTCCACCTG CACTAGAAGT TATTAAACCT AATCGCTTAC 4440 

TGTCAATTAC ATCATTTTCC CTTAAATAAT TTACTCCCTC AAT AAAAT CT CTGATAGAAT 4500 

TCCATTTGTT TAACGCCTTT CCTGAGCGAT ACCATTCACC ACCCAAATAG CCTCCACCTC 4 5 60 

TTACATGAAC TATAGCATAA ATAAAACCTG CATCTATTAT AGATAACATA ATTTCATCTA 4 62 0 

AATCAGAATT ATCATTCTTA CCATAAGCCC CATAGACACT TAGAATACAT TTTTTTCTTC 4 680 

TTGGGAGCTC ATCCGTATCT TCACTTTTCC AAAAT AAAGA AATCGGTATG CTTACATCAT 4740 

AACTGTCTTT TTTAGTCCAA ATCACCTTAG AAAAATATTT AGTATTATTC GATTTTATGA 4800 

TGGGTCTTTC AAATTCAGTT TTTAATGTAT TTTCTATTAA ATCAAAACTA AGTATTTTTT 4860 

CGTAAAAAGT TCTCCTCTCT AAAAACAGAA GAACACGATC AGAAAATGAA TTT T CAT AAA 492 0 

GTGTTGTCTT TT CATC AAAT GTTATCTTAT T AAC ACT C AA CTCCCTCAAA CTATTATTTT 4980 

TAAATGTAGC AAGATAAAAG ACGGAATTCG CTGCGTTTGA ACAGTCTAAA AGGATATAAC 5040 

GTCCTATACA GTGAACTCTT CTAGCCCTAT CTTGATATGG TATAGTAATA GAAACTCTGT 5100 
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CTCCCGAAGA AGTTTCCCTT AGAATTAGTT GATCTTTCTT TTCTTCAGTT GAAGAGAGCC 516 0 

CAAGAAAGTA CTGTGCTTTT TCTGTACTAA ATAGAGCGAT ATCTCTAGGT GTTGGGGCTA 52 2 0 

CCGTTTCTGT GTAAGAGTGT CTAACAAAAC CCGTCCGGTC GAAACTGTAT AG AAAAATC C 5280 

TGCCTTTCTG AAAGTCT AC T GACTTTACAA AACAATTATT GCTATCAATG TGGACTATTT 534 0 

TTAATCGAAA AGAGCATTCG TTTTCTTCAA ACAGTTCCTC TTCTGTAAAG CTATCAAAAG 5400 

ATTTATAGAA TAACTTACTT GGCCTCCCGT ACTCTTTGGA GCGAGTATAC ATAACACCGA 5460 

ATTTACCCAA AT AG AAC G AA CTTTCTACTG AAATATCTTC AATGATAAAT AACTCTTCCA 5520 

TAGTATATTT TTTTATTCCA ATTAAATTAG TCGTACGCAG T G AGG AT AC A ACCAAAACTA 5580 

TATAACTCTC AT C AG ATGAA AT CCT AAC AT CCTGTAAGAT ACTATCATCT GGCAAAGTAT 564 0 

ATTTTTCCAC ATCAAAGACA ATTTTAAGTG AATTTGAATT GTCTAAACTG GAAGAACTAA 57 0 0 

CCTTAGGAAT CCAGTCATTA TCTTCGACAT ACCATTCCTT TATTACACCA GTATTGGGTA 57 60 

TACTCCAATT ATCAAATTGG TACCAATATC GCCCTCTCCT AAAT AT C AAA GAATTCCATT 5 82 0 

TTTTTAATTC CTGAAATGAT GAAGAGATAG ACCTCTTATA GTGTGTTTTT TCCTGTATTG 5880 

TATTTAAAAA TATTTCATTA CTCTGATTCA CAAGTATGAC CCCTTAATAA TGGTATCTAA 5 94 0 

ATATTATATT TGAGGAAGAA TCGTCAATTT ATTATCCATT ATTGATACCA ATCCAATTGC 6000 

AACACCCGCA AATCCCGAAG CAATATCTGT TGTTATCTTT AAAC C ATT AT CTCCCGCAAT 6060 

AAC AAAT CCT TCTTCAATTA CACACAAATA TCTATAAAGT TGTTCAATTA ATTTCTTTTG 6120 

TCCTGAAAAG TTATCATCGA TATCACTATA TATATTATTA GCAACTTCAA G AC C AC AAAA 6180 

TCCGTTAAAT AAACCTGGTA ATACACAAAA AACTACATCA GTTGCCCTCT CTAAAGAAGT 6240 

TAAATATTTT AAGTATTTGC TTGACAAGAT TTCTTTATTT CTATTAATAA GTAAAAGCAG 63 00 

GCCAGCACTT CCAGTTGCTA GATATGGTAG TAATCTATGA CCTTGGCTGT ACTGCAATGA 63 60 

ATTATTACTA TCTACTTTAT AAGCAACTAA TTCTTTATCT ACAGCCAATT CTAGACCATT 642 0 

TT TAT AG AT A CTTTCACCAG T T AATTT AT A AGCTTCACCG AAGAGCCAAG CTACCCCTGC 64 80 

GTGACCATAT AGTAATCCAC CAAAATTCTC ATAAGGATCG TTACTCTGAA CAT C AC TAG C 654 0 

GCCAACTTTA CAAAAAGTTT CTGGATTTTC TAT AT AATTT AAAGTATATT CTCTAAGCCT 6 600 

AATTAGTATT TCTTCTCCTA GTTTATTATC AATTCCCCCT TTACTAAGAA AATACAGTCC 6660 

AACCAGTAAA ATTCCAGCCT GCCCACTATA TAAATTTTTA TTTTGTGAAT TCTCAAATAT 6720 

CTCTATAAAA TGAGTTGTAA AAAGTT C AAC TGCCCGATCT ATCTCCCCAA ATTCATAAAT 6780 

GAGCCAGATT GT AC C AATTT TACCATCAAA AAGACCAGAA AGGGACGATT TCTTAAAATT 6840 
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ATTTACTGCC TCATTAATAA CCTGTGTTCG AATCTCATAA TAG T CATC AA ACTTGAAATT 6900 

TTTTACTTTC TTAGCTAGTT GTTGATAACT CCAAAGGATA GCTAAATCTG AAAACGCAAT 6960 

TCCTTGATTA AAAT T C AG AC CATAATAATG AACTGGGAAG AATCTTGATT GAAATTCTTT 7 02 0 

ACGCCACTGT CCATAAGTTA GCGTAAACCC TCTCAATAAT TTTATAATAA AATCTTGTAT 7 080 

ATCTTGCTCA CTCTCGATAG TTCTAATCTC ATGCATGGGT TTTAAAACTT TTTTCCTGGA 7140 

AATATTCTCA ATCTGTGGAC ATTTAGAATC TAGATATGAC AATAAACTTT CTACATAATC 72 00 

TATATGTTCT CTTGTATAAC CCAAAGACTC AAATAGTTTT TTTCCTTCTA TCCTGGTTTG 72 60 

ACTTACATAG TTGTATGTCA AATCCGATGT AGT T ACT AGT GGCATGTATA AATAATGAGC 7320 

TATTTGTCTA ATACCATACC AATCTATCTC ACTGGGAAGT GTTTCTCGCC ATGCTCTAAA 73 8 0 

ACCAGGGGCT GCAACTTTAT GTACAACTTT TTCATCATTT GAAAAGACAG CCTGTTCCCA 7440 

GT C TAT TAT A CTAATCTCAT CTTCATCCTT AACCAAGATA TTTCCTAAAT GTAAATCTTG 7500 

ATGATATACA TTTTCAGAAT GAAACTTATT CGTTAAATCG ATGAGTTTTT CTACTATCTT 75 60 

TGAAACTCTC AATAGATAAT CTTTGGTCTT ATCAACAACT T C AT AT AAAG GAAAATTATT 7 62 0 

GGTAACCCAT CTATTTAGTG GAACGCCCTT CATATGTTCA ATTCCTAAGA AGGTGTGCTC 7 680 

CCAGATCTTA CCGTGCCAGT ATATTTTAGG CGTCTCACTC CATTCATTTA GAATTTTTAG 774 0 

TGCTTTGCAC TCCGAAGCTA ATTTCTCTGA AGAATAAGTA C C AT C AAAT C CTAGACCTGT 7 800 

ATACGGTCTA GCCTCTTTTA AAATTATTTT TTTCCCATCT TCTTTTAGCC TAGCATTATA 7860 

TATCCCACCA CTGTTTGAAA ATCTAATTGC ATTATCTATA ATAAAGGGAA AGTCTCCCTG 7 92 0 

TTTTTTATCT TTCTTGTCAA GCCATTTATT CAAAAAGTCA GGGGGCACTA TACCTTTTGG 7 980 

AATTTTAAAT ACTGGTAAAC GTTCATCTTT AACAACTTCA TCGCCAACAA TTAATTCATC 8040 

AATAGCAACC TTCTTTTCAT CATCCCTTGA CGGCCTAAAC ACACCATACC TCAGATATAT 8100 

TGGTGCTTCA TCCCAACGTT TATCGCTTAA AATATATGGC CC ATT AT ATT GCTTTAAGGC 8160 

ACTTTCTAAC CTTTGCAAAA CCGACTCTAA TTCATTTTGA TTTGGATAAC ATGTAATAAA 822 0 

TTTACCAGAA AATCCTCGAC TAACCAATTT CCCGTTTCGC ATGATAAATT TGTCTTCTGT 82 80 

ACTAAGATGT TTAAATGGAA TTCGCATTTC ATGGCAAATT TTTGCTACAT CTTGTAACAA 8340 

TTCATGTGAA CTGTTATACT CTGAACTAAT GTGTATTTTC CACCCTTGTC TTTCAACAAA 8400 

TTTTCCAATA GGGTATTGAT AAACCCACTC ATCATTATTC ATTACTTCGT GCCAATTAAA 8460 

AGGCAGACTT ACTTGGTACT TTATGCTAGT ATCTGTACTA TAATCATTAT TAGTGAAAAA 8520 

GAAAGGATGC TCCAAATTGA AATTATAATC CATAACAAAA TCTCCAAGAA ATTTTATCAA 8580 

ACTTAATATA TCTATAGCTA G AC AG AC TT A TTTAAATAAA AAGGGAGAAT CCTTTGGATT 8640 
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CTCCCCATAT AAGCACTAAC ATTCCAACGT GCACATATTG GAACGACATC CATAACTCCA 87 00 

GAGAATCTCT AAAGTT T AC A AT T T AAATG A ATTAACAATT TTCCCAACTA AAAGCACTCC 87 60 

AGTTACCGCA AC G AT TTGT A CTGAATGTAC TAAATCGCAT TCCATCAACT TCATCTGTTT 8 820 

CGTCAACTTG AACAGATACT AATTGAAGAT TTAATACTTC TTCTGCCATA GCTAGCTCCT 8 880 

C CT AT TT AAA TTTTTGGGAT TAAGTACTTT ATCCACCCTC ATTATACTCT CTCCACCAGT 8940 

AAAATGCAAG CAATTATACA ATGTTGT C AC ATAGAAAATA ATGTTTCCGT AACTTTTCAA 9000 

AGTAACTTCC ATCTCTCTCC CAAAACTGGA AGTTAGTTTT AGAAGTTACC T AAAAAT C AG 9060 

GTCACCTATT TTAAAAAAGC AGCAAACTAT AAACTAGTAG GTTCCACACC AAATGTAGTC 9120 

CCATACTGCC CCATAAGTCA GATTTATAGC GCACCATACC TAAAAACATC CCAAGTGAAA 9180 

CATACAAACA CCAAGCTAGA ATGGTTCCTG TATGATGTGC TAAGGCAAAT AAAACACTTG 9240 

TCAAAGCAAC TCTGATATCT AATTTTCTGA CCAAATTCCA TAAAATTTCT C GAT AC AG AA 93 00 

ATTCTTCAAC CATACTCGCA TTGATTAAGA ACAATAAAAA TGAAAACCAA GGAATTTGAT 93 60 

GTTGAAGGCC AATTAAGTTT GCTTGATTCG TGCTTCCTTG AG CAT G AAT C AGACTAAAAC 942 0 

ATAGACTTAT AATCAGTAGG CTAACAAATT CAACACCAAG CCATTTCATC CT AG AT T T C A 9480 

TATTGACCTT ATGCGCTTGT TTGCGTTGGC CATACATCCA TAAAAAAGAA ATGAGTGACG 9540 

AACCATAGAG AATCTGTAGT ATAGTTmACT CACCGATACA AAGAAATTTC AATAAG TATA 9600 

GAG r T AC C AA TAsGACATTT ACTTGTTGGA ATATATAAAC T GG AAT T ATT CTTTTCATAG 9 6 60 

TTACCTCCGA AATAAATCTT CATAATCTAA AT C T AAT AC C TGCACAATCC TTT 9713 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44: 

AAAGAAATTG TCAGAGAGTG GCTAGATGAA GTAGCAGAGC GGGCTAAGGA CTATCCAGAG 6 0 

TGGGTGGATG TTTTCGAGCG T TGCT AC AC C G AT AC CTTGG ACAATACGGT TGAAATCTTA 12 0 

GAAGATGGTT CAACTTTTGT CTTGACTGGG GATATTCCTG CCATGTGGCT T CG AG AT T C G 180 

ACAGCCCAAC TCAGACCCTA CCTTCATGTA GCTAAAAGAG ATGCCCTCCT GCGTCAGACC 240 

ATTGCAGGTT TGGTCAAACG TCAGATGACC TTGGT ACT C A AGGATCCCTA TGCTAACTCC 3 00 
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TTCAACATTG AGGAGAACTG GAAAGGGCAC CACGAGACTG ACCACACAGA CCTTAACGGC 3 60 

TGGATCTGGG AGCGCAAGTA TGAGGTGGAT TCGCTTTGCT ATCCTTTGCA GTTGGCTTAT 42 0 

CTCCTCTGGA AAGAGACTGG CGAGACTAGT CAGTTTGATG AGATTTTTGT CGCAGCGACT 480 

AAGGAAATTC TCCATCTGTG GACGGTGGAA CAAGACCACA AGAACTCTCC TTATCGTTTT 54 0 

GT CCG AG AT A CGGACCGTAA GGAAGACACC TTGGTAAATG ATGGCTTTGG ACCTGACTTT 600 

GCAGTGACAG GTATGACTTG GTCAGCTTTT CGTCCGAGTG ATGACTGTTG CCAGTATAGT 6 60 

TACTTGATTC CGTCAAATAT GTTTGCTGTA GTAGTCTTGG GTTATGTGCA AGAAATCTTC 72 0 

GCAGCATTAA ACCTAGCTGA TAGCCAGAGT GTTATTGCTG ATGCCAAGCG TCTTCAGGAT 7 80 

GAAATC C AAG AAGGAATCAA AAACTACGCT TACACCACCA ACAGCAAGGG CGAAAAGATT 840 

TACGCTTTTG AAGTGGATGG CCTAGGAAAT GCCAGCATCA TGGATGATCC AAATGTACCA 900 

AGTCTACTAG CTGCGCCCTA TCTGGGCTAC TGTTCGGTCG ATGATGAAGT GTATCAAGCT 9 60 

ACTCGTCGTA CCATTTTGAG CTCTGAAAAT CCATACTTCT ACCAAGGAGA AT AC GC AAG C 102 0 

GGTCTCGGCA GTTCTCATAC CTTCTATCGC TATATCTGGC CAATCGCCCT TTCTATCCAA 1080 

GGCTTGACAA CAAGAGATAA GG C AG AG AAA AAATTCTTGC TGGATCAGCT GGTTGCCTGC 1140 

GATGGTGGTA CAGGTGTCAT GCACGAAAGC TTTCATGTAG AT GAT C CG AC CCTCTACTCT 12 00 

CGTGAATGGT TCTCCTGGGC TAACATGATG TTCTGTGAGT TGGTCTTGGA TTACTTGGAT 12 60 

ATTCGCTAAG GGGCTCGCTT TAGCTCAACC GATTCTTATC AGAATCACAA GTTTACATTT 13 2 0 

AAAACGTTAA AATTTAAATT TAGAATGAGG TTTTACTTCA TGGAAAATGT TGTTGTACAT 13 8 0 

ATTATCTCAC ATAGTCACTG GGATCGTGAG TGGTACTTGC CTTTTGAAAG CCATCGTATG 1440 

CAGTTGGTGG AATTGTTTGA CAATCTCTTT GATCTCTTTG AAAATGACCC TGAGTTCAAG 1500 

AGTTTCCACT TGGATGGACA AACTATTGTC CTTGATGACT ACTTACAAAT TCGCCCTGAA 1560 

AATCGCGACA AGGTCCAACG CTACATTGAC GAGGGCAAAC TTAAAATTGG TCCCTTTTAC 162 0 

ATCTTGCAGG ATGACTACTT GATCTCCAGT GAAGCCAATG TCCGCAATAC CTTGATTGGT 1680 

CAACAAGAAG CTGCCAAATG GGGTAAATCA AC CC AG AT T G GCTACTTTCC AGATACCTTT 174 0 

GGAAATATGG G AC AAG CG C C TCAAATTCTT CAAAAATCAG GCATTCACGT GGCGGCCTTT 1800 

GGTCGTGGTG TGAAGCCGAT TGGATTTGAC AACCAAGTCC TTGAAGATGA GCAGTTTACG 1860 

TCTCAGTTTT CAGAAATGTA CTGGCAGGGT GTGGATGGTA GTCGTGTTTT AGGTATTCTC 192 0 

TTTGCCAACT GGTACAGTAA CGGGAATGAA ATTCCAGTTG ACAAAGATGA GGCCTTGACC 1980 

TTCTGGAAAC AAAAATTGTC AG ATGTG CGT GCCTACGCTT CGACCAACCA ATGGTTGATG 2040 

ATGAACGGCT GTGACCACCA GCCTGTACAG AAAAATCTGA GCGAAGCCAT TCGTGTGGCA 2100 
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AATGAACTCT TCCCGGATGT AATCTTTGTT CATAGTTCTT TTGATGAATA TGTTCAAGCT 2160 

GTAGAAGGTG CGCTTCCTGA AC ACT TAT C A AC TGTT AC AG GCGAGTTGAC CAGTCAGGAA 222 0 

ACAGATGGCT GGTACACACT TGCCAACACT TCTTCATCCC GCATTTACCT AAAACAAGCC 22 80 

TTCCAAGAAA ATAGCAACCT CCTAGAGCAA GTGGTAGAAC CCTTGACTAT TATCACTGGT 2 340 

GGACACAACC ACAAGGACCA GTTGACCTAT GCTTGGAAAA CACTTTTGCA GAATGCGCCA 2400 

CATGATAGTA TCTGTGGCTG TAGCGTGGAC GAAGTTCACC GCGAGATGGA AACGCGTTTT 2 4 60 

GCCAAGGTCA ACCAAGTAGG AAACTTTGTT AAAAGTAACT TGCTCAACGA GTGGAAGGGT 2 52 0 

AAAATTGCTA CGGATAAGGC T C AAAGTG AC TATCTCTTTA CTGTCATTAA CACAGGCTTG 2 580 

CATGATAAGG TCGATACTGT CAGCACAGTG ATTGATGTGG CGACTTGTGA TTTCAAGGAA 2 64 0 

TTGCACCCAA CAGAAGGCTA CAAAAAGATG GCTGCTCTTA TCTTGCCAAG TTACCGTGTG 2 700 

GAGGACTTGG ATGGTCGTCC TGTAGAGGCT ACAATCGAAG ACCTCGGAGC TAATTTTGAG 2 7 60 

TATAATTTAC CAAAAGACAA GTTCCGCCAA GCTCGTATTG CTCGTCAAGT GCGCGTGACC 2 82 0 

ATTCCAGTTC ACCTAGCGCC GCTTTCTTGG ACAACCTTCC AATTGCTGGA AGGAAAACAA 2 880 

GAACACCGTG AGGGTATTTA CCAAAACGGA GTGATTGATA CACCATTCGT AACGGTGAGT 2 94 0 

GTGGATGACA ACATCACAGT CTATGACAAG ACAACTCACG AAGCCTATGA AG ACT T TAT C 3 000 

CGCTTTGAAG ACCGTGGGGA CAT CGGAAAC GAGTATATCT ATTTCCAACC AAAAGGAACA 3 0 60 

GAGCCAATCT TTGCAGAGCT TAAGGGCCAC GAGGTCTTGG AAAACACAGC TTGCTATGCT 312 0 

AAAATCTTGC TCAAACATGA ATTGACCGTG CCTGTCAGTG CGGATGAAAA GCTAGAAGAA 3180 

GAGCAACAAG GTATCATCGA GTTTATGAAG CGTGAGGCTG GACGGTCAGA AGAATTGACA 3 2 40 

AACATTCCTC TGGAAACTGA GTTGACTGTC TTCGTTGACA ATCCACAAAT CCGCTTCAAG 3 3 00 

ACTCGCTTTA CTAACACTGC CAAGGATCAC CGTATCCGTC TCTTGGTCAA GACTCATAAC 3 3 60 

ACGCGTCCAA GCAATGATTC TGAAAGTATC TATGAGGTGG TGACACGACC AAACAAACCA 342 0 

GCTGCTTCAT GGGAAAACCC TGAAAATCCT CAACACCAAC AAGCTTTTGT CAGTCTGTAT 3 4 80 

GACGATGAAA AAGGGGTGAC TGTATCCAAC AAGGGATTGA ATGAATACGA AATCCTTGGG 3 54 0 

GATAACACCA TTGCCGTGAC CATTTTGCGT G C ATC AGGTG AGCTAGGTGA CTGGGGCTAC 3 600 

TTCCCAACGC CAGAAGCACA ATGCTTGCGG GAGTTTGAAG TCGAGTTTGC ACTTGAATGC 3 660 

CACCAAGCCC AAGAACGCTT CTCAGCCTAT CGTCGTGCCA AAGCCTTGCA GACACCGTTT 372 0 

ACCAGCCTTC AGCTTGCTAG ACAGGAAGGA AGCGTGGTTG CG AC TGGT AG CCTCTTGAGC 3780 

CATTCTGTTC TC AG CAT AC C GCAAGTTTGT CCAACAGCCT TTAAGGTAGC TGAAAATGAA 3 84 0 
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GAAGGCTATG TGCTTCGTTA CTACAATATG TGTAGTGAAA ATGTACGTGT GCCAGAAAGT 3 90 0 

CAACATCTCT TCCTTGACCT ACTTGAACGA CCATACCCAG TTCATTCAGG ACTATTGGCT 3 9 60 

C C ACAAG AG A TTCGTACAGA AT TC AT C AAA AAAGAAGAAA TTTAATTTCA AAAAGTAAAC 402 0 

ATCAAAAGAA AGGAGGGGCG AAAAAGTAAG AACTAACTGC TGATTCGCCC CTTTTATGGT 40 80 

AAAAACAATG ACCATTGCAA CGATTGATAT CGGAGGGACT GGGATTAAGT TTGCCAGTCT 4140 

GACTCCTGAT GGGAAAATAC TGGATAAGAC AAGTATTTCA ACGCCTGAAA ACTTGGAGGA 42 00 

TTTACTAGCG TGGCTAGATC AACGCTTGTC AGAACAGGAT TACAGTGGGA TTGCTATGAG 42 60 

CGTTCCAGGT GCAGTCAATC AAGAGACAGG TGTGATTGAT GGCTTCAGTG CGGTGCCCTA 4 32 0 

CATCCATGGC TTTTCTTGGT ATGAGGCGCT TAGCTCTTAT CAGCTACCTG T C C ATT TAG A 43 8 0 

AAATGATGCC AACTGCGTTG GACTCAGTGA ACTACTAGCT CATCCAGAGC TTGAAAATGC 444 0 

AGCCTGTGTC GTGATTGGGA CAGGGATTGG CGGAGCCATG ATTATCAATG GT AG ACT TC A 4500 

TCGAGGTCGC CACGGTCTGG GTGGAGAATT TGGCTACATG ACAACCCTTG CCCCTGCTGA 45 60 

AAAACTTAAT AACTGGTCGC AACTAGCATC AACTGGGAAT ATGGTACGAT ACGTGATTGA 462 0 

AAAATCTGGT CATACTGATT GGGACGGTCG CAAGATTTAC CAAGAGGCCG CAGCTGGTAA 4680 

TATCCTTTGT CAAGAAGCCA TTGAGCGCAT GAACCGCAAT CTGGCGCAAG GCTTGCTCAA 4740 

TATCCAGTAT CTGATCGATC CAGGTGTCAT CAGTCTGGGT GGCTCTATCA GTCAAAATCC 48 00 

AGATTTTATC CAAGGTGTCA AGAAGGCTGT TGAAGACTTT GTCGATGCCT ACGAAGAATA 48 6 0 

CACGGTCGCA CCAGTTATCC AGGCCTGCAC CTATCACGCA GATGCCAATC TCTACGGTGC 4 92 0 

TCTTGTCAAC TGGCTACAGG AGGAAAAGCA ATGGTAAGAT TTACAGGACT TAGTCTCAAA 4980 

CAAACGCAAG CTATTGAGGT TTTAAAAGGT CACATTTCTC TACCAGATGT GGAAGTGGCT 504 0 

GTCACTCAGT CTGACCAAGC ATCTATCTCT AT CGAGGGTG AGGAAGGTCA CTATCAATTG 5100 

ACCTACCGCA AACCTCACCA ACTTTATCGT GCCTTGTCCT TGTTGGTAAC AGTTCTAGCA 5160 

GAAGCTGATA AAGTAGAGAT TGAGGAACAA GCAGCTTACG AAGATTTGGC TTACATGGTT 5220 

GACTGTTCTC GAAATGCGGT GCTGAATGTG GCTTCTGCCA AG C AG AT GAT T GAG AT AT TG 52 8 0 

GCTCTCATGG GCTACTCAAC CTTTGAGCTT TACATGGAAG ACACTTACCA GATTGAAGGG 5340 

CAGCCTTACT TTGGCTATTT CCGTGGAGCT TATTCAGCAG AGGAGTTGCA GGAAATCGAA 5400 

GCCTATGCCC AACAGTTTGA CGTGACCTTT GTACCATGCA TCCAGACCTT GGCCCACTTG 5460 

TCGGCCTTTG TCAAATGGGG TGTCAAGGAA GTGCAGGAGC TCCGTGATGT AGAGGACATT 5 52 0 

CTTCTCATTG GCGAAGAAAA GGTTTATGAC TTGATTGATG GCATGTTTGC CACGTTGTCT 558 0 

AAACTGAAGA CTCGCAAGGT CAATATCGGG ATGGACGAAG CCCACTTGGT TGGTTTGGGA 5 640 
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CGCTACCTGA TTCTGAACGG TGTTGTGGAT CGTAGTCTCC TCATGTGCCA ACACTTGGAG 57 0 0 

CGCGTGCTGG ATATTGCTGA CAAATATGGT TTCCACTGCC AGATGTGGAG TGATATGTTC 57 60 

TTCAAACTCA TGTCAGCGGA TGGCCAGTAC GACCGTGATG TGGAAATTCC AGAGGAAACT 582 0 

CGTGTCTACC TAGACCGTCT CAAAGACCGT GTGACTCTGG TTTACTGGGA T T ATT AT C AG 5880 

GATAGCGAGG AAAAATACAA CCGTAATTTC CGCAATCATC ACAAGATTAG CCATGACCTT 594 0 

GCATTTGCAG GGGGAGCTTG GAAGTGGATT GGCTTTACAC CTCACAACCA TTTTAGCCGT 6000 

CTAGTGGCTA TCGAGGCTAA TAAAGCCTGC CGTGCCAATC AGATTAAAGA AGTCATCGTA 6060 

ACGGGTTGGG GAGACAATGG TGGTGAAACT GCCCAGTTCT CTATCCTACC AAGCTTGCAA 612 0 

ATCTGGGCAG AACTCAGCTA TCGCAATGAC CTAGATGGTT TGTCTGCGCA CTTCAAGACC 6180 

AATACTGGTC TAACGGTTGA GG AT T TT AT G C AG AT T G AC C TTGCCAACCT CTTACCAGAC 624 0 

CTACCAGGCA ATCTCAGCGG TATCAATCCC AACCGCTATG TTTTTTATCA GGATATTCTT 63 00 

TGTCCGATTC TTGATCAACA CAT G AC AC CT GAACAGGACA AACCGCACTT CGCTCAGGCT 63 60 

GCTGAGACGC TTGCTAACAT TAAAGAAAAA GCTGGAAACT ATGCCTATCT CTTTGAAACT 642 0 

CAGGCCCAGT TGAATGCTAT TTTAAGTAGC AAAGTAGATG TGGGACGACG CATTCGTCAG 6480 

GCCTACCAAG CGGATGATAA AGAAAGTTTA CAACAAATCG CCAGACAAGA ATTACCAGAA 654 0 

CTTAGAAGCC AAATTGAAGA CTTCCATGCC CTCTTTAGCC ACCAATGGCT GAAAGAAAAC 6600 

AAGGTCTTTG GTTTGGATAC AGTTGACATC CGTATGGGCG GACTCTTGCA ACGCATCAAA 66 60 

CGAGCAGAAA GCCGTATCGA GGTTTATCTG GCTGGTCAGC TTGACCGCAT CGACGAGCTG 67 2 0 

GAAGTTGAAA TCCTACCATT TACTGACTTC TACGCAGACA AGGATTTCGC AGCAACTACA 67 8 0 

GCCAACCAGT GG CAT AC CAT TGCGACAGCG TCGACGATTT ATACGACTTA ATATTCTTCG 6840 

AAAATCTCTT CAAACCACGT CAGCTTCCAT CTGCAACCTC AAAACAGTGT TTTGAGCAAC 6900 

CTGCAGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAAAAC AAGAACACCT 6960 

TGCTTGGCGC AGGGTGTTTC GCGTGAAACA GAAGAATTAT CTGGTTTCAA ATGCTACAGT 7 02 0 

TAGACAAACT TATGATAAAA TAGCAGAAAG TGAATGTTTC CTAAGAGCAA TTGGAGGTAT 7 080 

T AT GCT AC AC TTAAAATTAG TAAAACAAGA AATAGAAGCT GAAAAGCCAG CATCTGTAGA 714 0 

AGCTTGGATC ATTTCCGTCA AATTTAAAAA AGGTTGCTAC CGACATATAT AGATTCCAAA 7200 

AACAAAAACG TTAGCGGAAC TAGCAGATGT GATTTTATGG AGTTTTGATT TTGCAAATGA 72 60 

TCATGCTCAC GCATTTTTCA TGGATAATGT TGAGTGGAGT C AT GC AG AT T CTTACTTTCG 7320 

TAGCTTTGTT AGTG AC GATG TTGAAGAACG TTACACAGAA AATGTCTATC TGGATAGCCT 73 80 



WO 98/18931 



PCT/US97/19588 



416 

AAGTGTCAAA CAAAAATTTA AGTTTATTTT CGACTTCGGT GATGAATGGC GTTTTGAATG 744 0 

CCAAGTGCTG AGAGAAATCG AGACAGAGGA CGAAGAAGCT TATCTCGTAC GTTCGGTTGG 75 00 

AACGTCGCCA GAACAATATC CAGATTATGA TGGTTTTGAC TATGAAGAAT GGTAAAATTG 7 5 60 

AAATCAGTCT GTGTAGGCTT AGTATTTCAA TAG ACT T C C T G C AAAACT AG AATCCTAGTT 7 62 0 

CATGATTGAT AATACCAGCA ATCAAATTCA TTCGTAATCC GAAGCGTTTA CGATGATTTC 7 680 

GATAGGTTGT TGAAAACATT TTAAACGTTT TTACTTTGGC AAAGATGTTC TCAACCTTGC 7740 

TTCTCTCCTT AGATAGCGCA TGGTTATAGG CTTTATCTTC AGCTGTTAGT GGCTTGAGTT 7 800 

TGCTGGATTT ACGTGAAGTT TGTGCTTGAG GACATATCTT CATGAGCCCT TGATAACCAC 7 860 

TGTCAGCCAA GATTTTACCA GCTTGTCCGA TATTTCTGCA ACTCATTTTG AACAACTTCA 792 0 

TATCATGACA ATAGTTCACA GTGATATCCA AAGAAACAAT TCTCCCTTGA CTTGTGACAA 7 9 80 

TCGCTTGAGC CTTCATAGCG TGAAATTTCT TTTTACCAGA ATCATTCGCT AATTCTTTTT 8 04 0 

TTAGGGCGAT TGATTTTTAC TTCCGTCGCA TCAATCATTA CCGTGTCCTC AGAACTAAGA 8100 

GGAGTTCTTG AAATCGTAAC ACCACTTTGA ACAAGAGTTA CTTCAACCCA TTGGCTCCGA 8160 

CGGATTAAGT TGCTTTCGTG AATACCAAAA TCAGCCGCAA TTTCTTCATA AGTGCGGTAT 8220 

TCTAGGCTTA ATTTAGGTTT TCGTCCACCT TTTGCGTGTT TAAGTTGATA AGCTGTTTTT 82 80 

AATACAGCTA ACATCTCTTT AAAAGTCGTG CGCTGAACAC CAACAAGACG CTTAAATCGT 8340 

GTATCAGTTA ATTGTTTACT TGCTTCATAA TTTCGCAGGG AGTCTATTGA CTCTTTGGTA 8400 

GGTGTCAATG TTTTTTTCAT CTATCCCGAG AATTATTTTC CCGCCATTTG T ATTTG C AAA 84 60 

TGCTGAGTAG GTTTCCCAGA AAGACTCTGG AAGATTGTTT TTAGCTTTTT TGTATTCTAA 852 0 

ATCAACCCCT TCAAATTTTA AGTCCATATT TTTCCTTTAC ATCTGTTTTT TGTGGTTCTG 8580 

GTATTTGTTC AAGTTGAGTG ATAATATAGC GAATTGAATT TCGAGAGTTT TTACTCAGTT 8640 

AATTTCTTTT TTAACCC 8657 
(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 45: 

TCTATTTTGG GTATAGACTT AC CT AT AAAG AAAAATATCT ATACACTGCC TTACTAGCTA 60 

TACTGAACGA GTCAACAAAA ACGATATATA TTGATGATAT AAATACAGCA AGATTTTTTA 12 0 
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ACTTCTTTGG 
CAGAAATTGT 
TGGAACATAT 
TAAATATTTT 
TGGTTGATCT 
AAGGTACAAA 
TGCTAGTGTT 
AGCAACCACT 
TAATGGCTAT 
TTGGTGGGAT 
GGTTAGAGTG 
TATTCTTCAA 
ATTTGCTAAC 
TATCCAGGTA 
AGAAAAGGCT 
AGATGCTCTG 
ATTGGCCACT 
AGAAAGCGGT 
TAAATCAAGC 
ACAATACAAA 
ATTCTCTGTA 
GCATTTTGAT 
TACTATGATG 
TAATTTAATG 
CGTCTCAAAT 
GGCTACCCTC 
AAAGCACTAA 
CATAAAGAGA 
CACCAAGTTG 



CAATGATATT CCTAATTCGT 
TTCATTTAGT ACGTACGTTC 
ATTAAAATCA AGTTTTTTAT 
AGAAGATCAA TTAACAAAAC 
CATGTATAAA TACCTAACAA 
TACATGAATA TCAAAGAAAA 
TATCTAGGCG TTGACCAACT 
AAAAAGGGCG TTAAAGTAAA 
ACAGTTAAAG ACAAGCCGAC 
AGTTACAAGA ATACAGTTAA 
CATTTATTGC CTGTATTTGG 
CAGCAAGTAA ACAAATGGGC 
TACTCTTTGC TCCATAACAT 
ATACAATACA ACCCAGCTAA 
GCTGTCAAAT ACTTAGACAA 
GATCAATCAA ATTATGAGAA 
GGTTGCCGTA TTAGTGAGGC 
GTTATCAGCA TCAATAAGAC 
GCTGGTTATC GTGATATACC 
AACCGTCAAC AAATTCAGTC 
TTTACGGAGA AATATGCTTA 
GCTGCTGGAG TAACTAACGT 
CTCTATGCTC AGGTTAGCCC 
AT C ACTGAAA ATACTTACTG 
TATGAAACAG CTATCAACAA 
TTACTATACC AAAAATTAGT 
GGGAAAGCGC CCCAAAGTGC 
TTATTTTTTA AGGTTGTAGA 
GTCTTCGATA CGAAGCAATT 



417 

CTTTAAAAAA AATTGACTAT 
GACAACGTTC TAAAGTAATT 
TAGAGAATAT AGATGTTTCT 
ATAGAACAAT CAAAATTAGT 
AACCACGCGC CTTGCCTGCT 
AATCAAAAAG AATGGCCAAA 
AACGGGCAAA AAAGCCCGTA 
AGCGCGTGAT GCGATCAATA 
AATTACAACA TATAATGAGC 
G C C AAAT ACT CGCCAATCCA 
CGATTACAAG CTATCTAAAC 
TGACAAGGCA AATAAAGGCG 
GAATAAGCGT ATTTTGAAAT 
TGATGTCATC GTTCCACGCA 
CAAAGAATTA AAACAGTTTC 
CTTATTTGAT GTTGTTCTGT 
TCTGGCTCTT GAATGGTCTG 
ACTAAACCGC T AT C AGG AAA 
AATAGACAAA GCCACATTAC 
TTGGAAATTA GGCCGATCTG 
TGCTTGTAAC TTACGCAAAC 
AT C ATT T CAT GGTTTCCGCC 
GAAAGATGTT CAGTATAGAT 
GCATACTAAC CAAGAGAATG 
TTTATAAAAA ATAAGGGTGA 
AGGGGTAGTA AAAAGGGTAT 
TTATTTCAAA GGCTTTATAG 
ATGATTTCAA T C C AC GAT AT 
GGTTGTATTT AGCGATGCGG 



ATCGCACCTT 180 

CCTAAAATTT 240 

GGTTACACTG 3 00 

AAAAACTAAC 3 60 

GATGGAAAGA 42 0 

GAGTTTATTA 4 8 0 

CAACTGTTAC 540 

CTTTTGCTGC 600 

TTGTAAAAGT 660 

TGGAGGGATT 72 0 

TTACTACGCC 7 80 

AAAAAGGGGC 84 0 

ATGGCGTAGC 9 00 

AACAGCAAAA 9 6 0 

TTGATTATTT 102 0 

ATAAGACTTT 1080 

ATATTGACCT 1140 

TAAACTCACC 12 00 

TTTTACTGAA 12 60 

AAACAGTTGT 13 2 0 

GCCTAAATAA 13 80 

ATACACATAC 1440 

TAGGCCACTC 1500 

CAAAAAAAGC 15 60 

CCCATTTCCG 162 0 

TAAATTATAA 16 80 

CCTATAATCA 1740 

TCAGCTACTT 1800 

TCTGTACGTG 1860 
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AAAGTGAACC AGTCTTGATT TGTCCTGCGT TAGTTGCAAC TGCAATATCA GCGATTGTTG 1920 

AATCTTCAGT TTCACCTGAA CGGTGTGATA CAACAGCAGT GTAACCAGCT TCTTTAGCCA 1980 

TTTCGATAGC TTCAAAAGTT TCAGTAAGAG TACCGATTTG GTTAACTTTG ATAAGGATTG 2 04 0 

AGTTAGCAGC ACCTTCTTGG AT AC C ACGT G CAAGGTAGTC AGTGTTTGTT ACGAAGAAGT 2100 

CGTCACCAAC AAGTTGTACT TTCTTACCAA GACGTTCAGT AAGAGCTTTC CAACCATCCC 2160 

AGTCGTTTTC ATCCATACCA TCTTCAATAG TGATGATTGG GTATTTGTTA ACCAATTCTT 2220 

CAAGGTAGTC GATTTGTTCT GCAGATGTAC GAACAGCAGC ACCTTCACCT T C AAAT T TAG 2280 

TGTAGTCGTA AACTTTACGT TCTTTATCGT AGAATTCTGA TGAAGCACAG TC AAAT C C G A 23 4 0 

T AAAT ACGT C TTTACCTGGT AC AT AT C C AG CAGCTTCAAT CGCAGCAAGG ATAGTTTCAA 2400 

CACCATCTTC AGTTCCTTCG AAACGAGGAG CGAATCCACC TTCGTCACCT ACGGCAGTTT 24 60 

CCAAACCACG TGATTTAAGG ATTTTCTTAA GAGCGTGGAA GATTTCAGCA CCGTAACGAA 2520 

GGGCTTCTTT AAATGTTGGC GCACCAACTG G C AAG AT CAT GAACTCTTGG AAAGCGATTG 2580 

G AG CGTC AG A GTGAGAACCA CCGTTGATGA TGTTCATCAT TGGAGTTGGA AGAACTTTAG 2 640 

TGTTGAATCC ACCAAGATAG CTGTAAAGTG GGATTTCAAG GTAGTCAGCA GCAGCACGAG 27 00 

C T AC AGCG AT AG AC AC AC C G AGGATTGCAT TCGCACCCAA TTTACCTTTG TTAGGAGTAC 27 60 

CGTCAAGTGC G ATC AT AG C A CGGTCAATAG CTTGTTGATC ACGTACATCG TAG C C AATG A 2 82 0 

TAGCTTCAGC AATGATGTTG TTTACGTTGT CAACAGCTTT TTGTGTACCA AG ACC AC C G T 2 8 80 

AACGAGATTT GTCACCGTCG CGAAGTTCAA CTGCTTCGTG TTCACCAGTA GAAGCTCCTG 2 940 

ATGGAAC CAT ACCACGTCCG AAAGCACCTG ATTCAGTGTA AACTTCTACT TCAAGTGTTG 3 000 

GGTTACCGCG TGAGTCTAGG ACTTCGCGAG CGTAAACATC AGTAATAATT GACATTTTTT 3060 

ACTCTCCTTA TGAGTTAAAT TTTTTACACC TCTATAATAC CTTAAAACCC CTCCTTTTTC 3120 

AAGAAAAAAC GTTATCTTTG TGCAACTTTT CCTTAACTTT ATAAAGTAAT CGCTTTCTTT 3180 

TGTCTGTTTT ATTCTAACTT T T ATG AT AT A CTGTTTTCAT GACAGATTTA TCAAAACAAT 324 0 

T ACT TG AAAA AGCTCATGGT GGGTTAAAAA TAAATCCGGA TGAGCAAAGA CGCTATCTTG 33 00 

GTACTTTTGA GGAAAGAGTT CTTGGATATG TAGATATTGA CACAGCAAAT AGCCCTCAGT 3 3 60 

TAGAAAAAGG CTTTTTATTT ATTTTAGAAA ACCTTCAGGA AAAAGCAGAG C C ACT ATT T G 342 0 

TGAAGATTTC AC C AAC T ATC GAATTTGATA AGCAAGTTTT CTACTTAAAA GAAGCAAAAG 3480 

AAACTGATAG TCAAGCCACC ATAGTATCTG AAGAGCATAT TACTTCTCCT TTTGGCCTGG 3 540 

TTATTCATAG CAATGCACCA GTTCAAGTAG AAGAAAAAGA CCTTCGACTT GCTTTTCCAA 3 600 

AACTTTGGGA AGTTAAAAAG G AAGAAC C AG CCAAAACATC CTTATGGAAG AAATGGTTTA 3 660 
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GCTAAATCTT G C AC AT AT TT AATAAGTGCC CAATATTGGC AGCCGTGCGC TCCAGATAGA 372 0 

AACTGGCATT TTTCAAACTA TCTTCTAAAG GTTCACTTTT CTCCAAAATA GAAAAGACAG 3 7 80 

CTTGGATATT TT C AAATGGT AGGGGAGGTA AATCTTCAGC AAGACTACCG CAAATAGCAA 3 840 

TAACAGGAAC TCCAACAGGG GTTCTTTTTG CAACACCTAT AGGCGCTTTC CCAGCAAAGC 3 900 

TTTGACTATC AAGTCTTCCT TCTCCAACAA CAACCAAGTC AGCATCTGAA ACTTTCTTAT 3 9 60 

CAAAGTTGAT TAAGTCCAAG CAGGTATCAA TTCCAGACAC GATACTTGCC TGAGCAAAGG 4 02 0 

CACACAAACC ACCAGCAAGG CCTCCACCTG CTCCTGCTCC TTTAATTTCT AATGTTGCAG 4080 

GTGAGAATTT TTCATAAAAA TCTTGGATCG CCTGATCTAC GACTGCAAAC ATAGTCGGAT 414 0 

GTAGACCTTT TTGATTGCCA AAAGTGTAAG TCGCACCTTG ATGACCACAT AAGGGACTCA 4200 

CGACATCTGC TAAAATATGA ATTTGAACAC CTTCAGGAAT TTTATAGCAA TTTTCTGTTG 42 60 

AAACAGAAGC TAAGTTTAAT AAGGATTGAC CGGAAGCAGG CAAGACATTT CCATCCCTAT 4 32 0 

CATAAAATTG AT AAC CT AAA CCAGCAGCAA TCCCCAGTCC TCCATCATTA CTGGCCGTGC 43 80 

CACCAACACC GATATAAATA TCTTTAATCC C T T TAG AG AT GAGATGAAGA ATCAACTCTC 4440 

CAATACCACA AGTTTGGATT TGAAGTGGAT TTCGTTTCTC TAGCGGAATT TTTCCAAGAC 4 500 

CAACCAAGTC AGCTACTTCA AATAGTGCCA GTTCCCCTTT TTGAAAATAG CGCATGGCTT 4560 

CTTTTTGTCC AAAAGGGTCT GTCACTTGGA TCCATTTTTC TTTTAGGTCA AG AG AATGT C 4 62 0 

GGATAGCATC TACAGTACCT TCTCCCCCAT CACCAACAGG G C AG AGG AG A CATTCTACAT 4 680 

CTGCTATCGA TTGTTGGAAG CCTCTTTTTA TTGCTTCAGC TACCTGTTGA GCTGTCAAGC 4 74 0 

TTTCCTTAAA CGAATCCGGT GCAATTACAA TCTTCATATT TTCCCTCATT CTAAACAGTC 4 800 

AATCAAAGGG AGAACTTCTA AAAAATCCCT CTTGTCAACA TGATGTGGTA TTTCTTTTTT 4 860 

GAGCACTTCT T TGGC AC AAA AGGCGATTCC TAACTTCGCC G ACT T C AAC A TTAATAGATT 4 92 0 

ATTAACCCCA T C AC CG AT T G CCACCGTTCT TTCTTTAGAA AGTTTTAGTT TCTTTCTCCA 4 980 

TTTTTCCAGA GTCTCTTTTT TGACCTGGGG ACTTATAATT TGTCCAACTA ATTTTCCTGT 504 0 

TAAAAGACCT TCTTTGACTT CAAGCTAGTT GGCAGTGAAA TAGGCAATAC C AAGGG AT TT 5100 

TGCTAATCTC TCCAACTATT GGTGTAAATC CACCAGACAC CAGACCAACT AGGATGCCAT 5160 

TCTTTTGGAG AATAGAGATG AAC TCTGGG A CATTTAGCGA TAGATGAATT GAGTTGAAGA 5220 

CGTTATCAAA G AC C AAAAT A GGAAGACCTT CCAACAAGGA CACTCTTTTT CTTAAACTGC 52 80 

TTTCAAAGAC CAACTCTCCT CGCATTGCTC GACTTGTAAT CTGCGAAATT TCCGCCTCAT 534 0 

GACCTGCCTC TCTCCCTAAA AGATCAATCA CTTCTTCTAG GATTAAGGTT C CATC T AC AT 5400 
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CCAAAACACA CAAGCCTTTT ACTTGAGACA TCAGTTCTCC TCTCTAAACA GCCTAAAAAT 5460 

CGTATGAAGT CATCATACGA TTTTATCTAT TAATTAACTA AACTATGGTA CAAGTCAAGG 552 0 

TATGACTTGC AGGCTGTATC C C AT G AGAAG TCACTCTCCA TAGCTTGTTT TTGTAGGTTT 5580 

CTCCAAATGT CTGGATGGTT TCTATACAAG TCCAATGCTG TTTGGAAAGT CCAATTTAAC 564 0 

CAATAAGGAG ATAGATTGTC AAAGCTAAAG CCAGTACCGC TTCCTTCGAT TGGATTGAAA 5700 

GCGCGAACTG TATCTCGCAA GCCTCCAACT TCATGGACCA ATGGCAAGGT TCCATAACGC 57 60 

ATAGCCATCA TTTGAGACAA GCCACACGGT T C AAAACG AC TTGGCATGAG GAAGAGGTCA 582 0 

CAAGCAGCGT AGATTTCCTG AGCAAGTTTG ACATCAAAAG TGATATTTGT TGATAGCTTG 5880 

TCTGGGTAAA TCTGAGCAAA C C ATGAG AAA GCTCCTTCAA AGGCTGGATC GCCAGTTCCC 5940 

AAAAGAACAA TCTGAACATC TTCTTGCAAG ATATGGTGAA GACTTTCGAC CACCACATCA 6000 

AAACCTTTTT GACGTGTCAA AC G AG AAAC A ATTCCCACCA GTGGAACGTC TGCTCTAACA 6060 

GGCAAGCCAA CTCTTTCTTG CAATTTTGCC TTATTTTTGG CTTTCCCAGA CAAATCTTCC 612 0 

TGATTGAAAT GATAGTCTAA AAG AG CAT C C GTCTGAGGAT TATAAAGATC AGCATCAATC 6180 

CCATTCACGA T AC C AG AT AC TTTACCAGAC TCCATTTTAA GAATCTGATC CAAATTACAT 624 0 

CCAAACTGAC TAGTCATAAT TTCATGAGCA TAGCTAGGTG AAACGGTTGA AACACGGTTC 6300 

GCATAGAGAA TACCTGCCTT CATCCAGTTC AGACAGTTGT T CC AT CG AAG GGTGCCATCA 63 60 

GCGTAACGTT CAAAGCCAAC TC C AAAC AAA TCACCCAACA TTCCTTCTGA AAATTGTCCT 642 0 

TGGAATTCTA AATTATGAAT GGTTAAAACT GTTTCAATGT CCTCATAGGC TTGAATCCAA 64 80 

CGGTATTTTT CCTTCAACAA GAAAGGAATC ATAGCTGTAT GGTAGTCATG AACATGGAGA 654 0 

AGATCAGGAA TAAAGTCAAT CCTTTCCATA GCCTCAATGG CAGCCAGTTG GAAAAAGGCA 6 600 

AAGCGTTCTC CGT CATC AAA ATCACCGTAA ACATGACCAC GGAAGAAATA ATATTGATTG 6660 

TCAATAAAGT AGAAGGTTAC ACCATTTAAT ACTGTTTTCT TAATTCCACA ATACTGTCTG 67 2 0 

CGCCAACCAA CGCTCACCTC AAAATG AAG C ACATCTTCAA TCTGATTTCC AAATTTAGCC 67 80 

TCTACCATAT CATAGTAGGG TAAAATCACT GCAACTTCGT GCCCAGCTTT TACCAGTGAT ii84 0 

TTTGGAAGAG CGCCAATGAC GTCTCCCAAA CCACCTGTTT TTGAAAAGGG TGCACCCTCT 6 900 

GCTGCTACAA ATAAAATTTT CATGAATGAA TATCCTCTGT TACTTTAGCA CCTTTCTTAA 6960 

CCACAACTGG ATGTTCTGCA GTTCCTCGAA TCACAACACC ATGCTCAACT TCAACCCCTT 7020 

TGTCCAAGAT AGCATATTCG ACCTGAGCCC CTTCTCCAAT AACAACACGA GGGAAGAGCA 7 080 

GGCTATCTTT AACCAAGCTA TCCTTATGGA CATGAATATT ACGTGATAGA ACAGAATTAG 714 0 

CTACTTGACC TTCAATAATA CTACCAGAGG CAAACTGAGA AGTGCTTACC TTAGATGTAT 72 00 
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TAGCATAGTA AGTTGGCTCT TCGTTTTTGA CCTTTGTATA AATCTTTTGG TTTGGTGAGA 72 6 0 

AAAGAGAATA GAATTTTTGT GATTCAAGCA TATCGATATT CGCTTGATAA TAAGATTTAA 73 2 0 

CAGAGTGAAT ATTGGCTAGA TAGCCCGTGT ACTCGTAGGC GAAAGCTCCC TCTTTTACAG 73 8 0 

CCAAATCCCG TAAAACATAG CGCAATTTCT CTGGATGTTC TTTTTTAGCT TCTTCTTCCA 7440 

AGTGTTCAAT CAACCAAGGT GTATCAACGA C AAAG AT AT C TGTAGACATA TTGAACGTTT 7 500 

CAGCTGTTGA CTTGCTATCA AAGAGTTTAT GAGAAAGAAC ATGGTCTGTT TCATCTACAT 7 560 

CCAAGATTGC ATTTACTTCT GAAATATCTT TCTTAGCTAG TTTTTTATAA ACTACAGTGA 7 62 0 

TAGGCTCTTT TGT TGT ACT A TGTAGGTGGA AAACTTGGTT CAAATCAATG TTAATAAGAA 7 680 

CATCGCAGTT GAGGGCAACC GTTTGGTTTG AGCCAGAACG TTTCAAATAA GTAAGAAGCT 774 0 

GTTGGTAGTA TTCTTTTCCA ACTGTACTAC TTTCTACACG GGTATTGTAA ATTCCTAGAT 7 8 00 

AGTAATGGCT AAGAAGGGTT GATAAGCCCC ACTCGCGTCC TGAACGAATA TGGTCAAATA 7 8 60 

CTGAGCTGAT ATTATCCTGC TGGAAAATAC CAAAGACACT ACGAACACCT GCATTAGCAA 7 92 0 

GGCTTGAAAG TGGGAAGTCA ATCAAACGAT ATTTCCCACC AAATGGCAAA CTTGCTACTG 7980 

GACGGTGGTC CGTCAATGTC GACATATTGT GAAAACCAAC TGTATTTCCT AAAATGGCAG 8040 

AATATTTATC AATCTTCATC TGTTGCTACC CCCACTACTT CATTATATCC TACAACTTGT 8100 

ACTTCATCTG TTCCATCAAT TTCGACACCG TCAGAAATAA TCGCACCTTC ACCAATAATG 8160 

GCACGTTTAA TCTTAGCTCC TTGACCAATG ATAGCTCCAC TCATGATAAC TGAATCAAGG 822 0 

ACTTCCGCTC CTTCGCGAAC TTGCGCGCCT GTTGAAAGGA TAGAATGTTT AACAGTTCCA 82 80 

TCAACGAAAC ATCCGTCTAC AACTAATGAG TCTTCCACAT GAG CAT T T GC CCCGAGGAAG 8340 

TTTGGTGGTG AAATCAAGTT TCTTGAGTAA AT C TT C C ATT GACGGTTACG ACT AT C C AAG 84 00 

GCATTTTCTG GAGAAATATA CTCCATGTTC GCTTCCCAAA GTGACTCAAT AGTACCAACA 84 60 

TCTTTCCAAT AAC C ACT AAA TTCGTAAGCA TAAACACTTT CACCTGACTC AAGGTAATTT 8 52 0 

GGAATGACAT TTTTACCAAA GTCTGACATG CCAACCTTGC TCTTTTCAGC AG CG AC T AAC 8580 

ATATTACGAA GGCGTTGCCA ATCAAAAATG TAGATTCCCA TAGAAGCTTT TGT AG AT TT A 8 640 

GGTTGAGCTG GTTTTTCTTC AAATTCAACA ATACGATTGT TAGCATCTGT GTTCATGATA 87 00 

CCAAAACGGC TTGCTTCTTT AAGAGGGACG TCTAAAACTG CTACTGTCAA GCTGGCATTA 87 60 

TTATCCTTAT GAGACTGGAG CAT AT CATC A TAGTCCATTT TGTAGATGTG ATCCCCAGAC 8 82 0 

AAAATCAAGA CATACTCAGG ATTGACACTG TCGATATAGT CGATATTTTG GTAAATAGCG 8 8 80 

TGACTAGTCC CCTCAAACCA ACGATTTCCT TCACTTGCAG AATAAGGTTG AAGAATAGAG 894 0 
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ACACCTGAAT TAATACCGTC TAGTCCCCAG CTTGAACCAT TCCCAATATG GTTGTTGAGA 9000 

GCAAGTGGTT GATACTGTGT AACGACCCCA ACATTGTGAA TCCCTGAGTT GGCACAGTTT 9 0 60 

GATAGGGCAA AGTCAATGAT ACGGTAGCGC CCACCAAATT GCACAGCTGG TTTTGCGATG 912 0 

CTTTGAGTGA GTTTACCGAG ACGAGTTCCT TGCCCACCAG CAAGAATCAA AGCTAACATT 9180 

TCATTTTTCA TTTTCTACTC CTTTTTGGTT TTTATTTGTG ACGGTTTTAG TAGATTTCAA 92 40 

GCGACGTTTG ATTTTCCATA CACTTGCTCC CATAGCCGGT AGGGTAAAGG TTAAGGTCTG 9300 

CTCATAATCT TTCCATAGTC CTTCTTGCGT TTGAACAGTT TGATTATGTT CTTTCCAAAC 9 3 60 

GCCTCCCCAC TCTTCCAACT CAGTATTCCA TACTTCTTCG TAAATTCCTG CAACGGGTAG 942 0 

TCCGATTGTA AAATCTTTCC GCTCAACAGG TACCATATTA AAGATACAGA CTAACATTTC 9480 

TCCCTTTTTA CCCTTACGAA TAAAGGAAAG AACACTCTGG TCTCGATTAT CCGCATCAAT 9540 

GATTTCAATA C CAT C AT AGC TGGTATCAAT TTCCCACAGA C AG C GAT GAT CTTTGTAAAA 9600 

CTGGTTTAGC TGAGAAGCGA AATACTTCAT CTTAGCATTC ATTGGGTCTT CTAGGTTAGA 9 6 60 

CCATTCCAAC TGTTCTTCAG ATTTCCATTC TAGGAATTGA CCGTATTCGC TACCCATGAA 972 0 

GAGCAATTTC TTACCAGGGT GACAAATTTG GTACGTATAG AGATTGCGCA AGCCTGCGAA 9780 

TTGATTGTAA CGATCTCCCC ACATCTTATG CAT CAT ACT C TTCTTGCCAT GAACCACTTC 9840 

ATCGTGCGAG AATGGCAAGA GATAATTCTC CTTGAAAACA T AC AT AAAG C TGAAAGTCAC 9900 

CAGGTTAAAG T C AT ATTT AC GATAGATCGG ATCTTCTTCG TAGAAACGGA GGATATCATT 9960 

CATCCAGCCC ATGTTCCATT TGTAGTCAAA TCCTAGACCA CCAATCTCTT TCATTCCCGT 1002 0 

AATCTTGATC GCAGACGAAC TTTCTTCTGC AATCATCATC AC AT C TGG AT ATTCTAACTT 10080 

AATAACCTCA TTCAAGCGCT GAAGGAAATA ATAACCTTCA TAGTTGAGAT TTCCGCCATC 10140 

TTTATTAGGT GTCCATGGAG CATCATCATA GTCCAAATAG AGCATGTTGC TAACAGCATC 10200 

CACACGAATA CCATCCAAAT G AT AGAC AT C AATCCAATGC TTAATGCAAG AAATTAAGAA 10260 

GGACTGGACT TCATTTTTTC C AAGGT C AAA ATTAAGGGCA CCCCAACCAT GGTTATGAGC 10320 

CTTATTATGG TCTTGGTATT CAAAAGTCGG TGTCCCATCA TAATAGGCTA AGGCATCATC 103 8 0 

GTTGATGGTA AAGTGACTGG TACCCAGTCC ACAATAACCC CAATATTATG GGTATGACAC 10440 

TCCTCGACAA AATCTTGAAA CTCCTCTGGT CGGCCATAAG CATGCTCTAA AGCGAAGTAA 10500 

CCCATAAGCT GATACCCCCA ACTCAAGCCC AAAGGATGGG AC AT C AAGGG CATAAACTCA 105 60 

ATATGAGTAT AGTTCATTTC AACGAGATAA GGAATGAGTT CATCCTTGAG CTGGGCAAAA 10620 

C T AT AAGGAC TGCCATCAGA ATTTCTTTTC CATGATCCAG CGTGAACTTC ATAAATATTG 10680 

ACAGGACGCT CTTCAAAGCC CCAACGTTTT CTTCGTGCCA GCCAAAGTCC ATCCTTCCAT 10740 
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TTCTTCTCAG GAAGCTCTGT TACGATTGCC CCTGTTCCTG GACGAGCCTC ATACCTGACA 10 80 0 

GCAAAAGGGT CAATCTTCAT CAGTTGATGA CCATTTTGAC GTGTGACATG ATATTTGTAA 108 6 0 

ATATGCCCTT CTTGAGCCAT ATTGGTAAAG ACTTCCCAGA CCCCAAAATC ATTTCTTACC 1092 0 

ATTGGAATCT GATTTTCAAT CCAGTTGGTA AAATCACCAA CCAAGTGAAC AGCCTGAGCA 109 80 

TTAGGTGCCC AAACACGGAA GGTATAGCCA TGCTCTCCAT TTAGTTCTTC CCTATGTGCT 11040 

CCTAGATAAT GTTGGAGATA AAAATTTTCA CCCGTCATAA AGGTTTTTAA TGCTTCTCTA 11100 

TTATCCATAT ACTCCCCTTC TCCTGTAAGC GTTTTCTATG T TT T T ATT AT ACTACCTTTT 11160 

TAGAGAAGAT TCAAGTAAAT T AC TAT AC T T CTTTAATTAT TTTGAAAATC TACAACAAGT 112 2 0 

TCACTTACTC GTTCAATTGT AAATCAATAT TTTTTCAAAA AATTGCGAAA ACGCCTTTCT 112 80 

TTTTCTACTA TAGTGAAATG AAATAAAACA TGCGCAAATC GATTAAGGAA TTTAATCTAA 11340 

TTTCTAACAA TGTCTTAGAA ATCAAAGTGT ACTATTTTAA CTCC 113 84 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TGTTGATTTG TTACTAGACG TTGACCAACG TCCTTCGGCT GGAAAAGGAA TTCTCCTTAG 6 0 

TTTCCAACAC GTTTTCGCCA TGTTTGGTGC GACCATCTTG GTACCATTGA TTTTGGGAAT 12 0 

GCCTGTATCT GTTGCCCTTT TTGCTTCAGG TGTTGGAACA CTCATCTACA TGATTGCTAC 180 

TGGTTTTAAA GTTCCAGTTT ATCTAGGTTC TTCATTTGCC TTTATCACAG CTATGTCACT 2 40 

GGCTATGAAA GAAATGGGGG GGGATGTATC TGCTGCCCAA ACAGGGGTTA TCTTGACTGG 3 00 

TTTGGTCTAT GTCCTTGTTG CTACCAGCAT CCGATTTGTA GGAACAAAAT GGATTGATAA 3 60 

ACTCTTGCCA CCAATCATTA TCGGTCCTAT GATCATCGTT ATCGGTCTTG GACTTGCAGG 42 0 

TTCAGCTGTT AC C AATG C AG GTCTTGTAGC AGACGGAAAT TGGAAAAATG CTCTGGTAGC 480 

CGTTGTTACT TTCCTAATTG CTGCCTTTAT CAATACAAAA GGAAAAGGCT TCCTACGAAT 540 

CATTCCATTC CTCTTTGCCA TTATCGGTGG TTACCTTTTC GCACTAACTC TTGGCTTGGT 600 

TGACTTTACA CCAGTTCTTA AAGCCAACTG GTTCGAAATT CCTGGTTTCT ACTTGCCATT 660 

TAGCACAGGT GGTGCCTTTA AAGAGTACAA TCTTTACTTT GGTCCAGAAG CCATCGCTAT 72 0 
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CTTGCCAATC GCTATCGTAA CAATTTCTGA ACATATCGGA G AC C AT ACTG TTTTGGGTCA 780 

AATCTGTGGT CGTCAATTCT TAAAAGAACC AGGTCTTCAC CGTACTCTTC TTGGTGACGG 840 

TATCGCAACT TCTGTTTCTG CCTTCCTTGG TGGACCAGCC AATACAACTT AC G G AG AAAA 900 

TACAGGGGTT AT C GGT AT G A CTCGTATCGC TTCTGTCTCA GTTATCCGTA ACGCTGCCTT 9 60 

CATCGCGATT GCCCTCAGCT TCCTTGGTAA ATTCACTGCC TTGATTTCAA CTATTCCAAA 1020 

CGCTGTACTT GGTGGTATGT CAATCCTTCT CTATGGGGTT AT CGCC AG C A ATGGTTTGAA 10 8 0 

AGTCTTGATT AAAGAACGTG TTGATTTCGC TCAAATGCGA AACCTCATCA TCGCAAGTGC 1140 

TATGTTGGTT CTTGGACTTG GAGGAGCTAT CCTTAAACTT GGTCCAGTTA CACTTTCAGG 12 00 

TACTGCCCTT TCAGCCATGA CAGGAATCAT CTTGAACTTG ATCTTGCCAT ACGAAAATAA 12 60 

AGACTAAGAG TCTAAATACA CCTAATCCAC TCAGACAGCT GAGTGGATTT TTCGTATACC 132 0 

ATAATAAAAG TGTCTTAACA AAATTATTAA AATCAAAAAA CGTATAATAT CAGATATTCT 13 80 

AAAACCTTGA TACTGTACGT TTTATCATAG AAATTTTTAC TTTATTTTCT CAT C AAATG A 144 0 

GATTTGCATC AATCTCTTGT CTTACTTGCG TTTCTTCTTC GCTTTCTTCA TTTTGTTAGC 1500 

CATACGTTTC ATGGACTGTT T C ATGG C AAA TTCACCAATT TTACCTTTCA AACCGCCACC 1560 

AAACATCTGG CTCATATCTG GCATTCCTGC TCCTCCGAGA GCTGATAAGT C AG G CAT AC C 162 0 

GCCTTGTCCC ATCATTCCTT CAAGGGCAGA CATATCCATT CCTCCCATAT TTGGCATATT 1680 

TTTAGGAAGG TTATTTGGAT TAATCCCCAT TTGCTTCATC ATTTTATTCA TATCCCCAGA 1740 

CATAACACCC TGCATGAGCT GTTTAGCCTG GTTAAAGTCC TTGATGAATT TATTGACTTC 1800 

GACGAATGTA TTTCCAGAAC CAGCAGCAAT ACGACGGCGA CGGCTTGGAT TTAACAAATC 18 60 

TGGGTTTTCA CGCTCTTCAG GTGTCATCGA AGACACAATG GCACGTTTAC GAGCAATCTG 192 0 

GCGTTCATCC ACCTTCATGT TTTGAAGGGC TGGATTGTTG GCCATACCTG GAATCATCTT 1980 

GAGCAAGTCT TCCATCGGCC CCATATTTTG CACCTGATCT AATTGATCGA TGAAATCATT 2 040 

AAAATCAAAG GTGTTTTCGC GCATCTTCTC AGCCATTTCA AGGGCTTTTT GTTCATCGTA 2100 

TTCCTGAGAA GCTTTCTCAA TCAAAGTGAG CATATCCCCC ATACCAAGGA TACGGCTAGA 2160 

CATGCGGTCT GGGTGGAAGG TTTCAATGTC CGTAATCTTT TCACCTGTAC CAGTGAACTT 2 220 

GATTGGTTTT CCAGTAATGT GACGAACAGA CAGAG C AGC A CCACCACGAG TATCGCCATC 22 8 0 

AATCTTGGTA AGGATGACCC CAGTCACTTC CAACTGAGCA TTAAACTCAC GCGCAACATT 2340 

GGCTGCTTCC TGACCAATCA TAGCATCAAC GACAAGCAAG ATTTCATTTG GTTGAGCCAA 240 0 

TGCTTTCACA TCACGAAGCT CATTCATGAG GAGCTCATCA ATCTGCAAAC GACCCGCAGT 24 60 

ATCAATCAAG ACATAGTCGT TATGATTAGT TTGGGCTTGC TCCAAACCTT GACGTACAAT 2 520 
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CTCAACAGCT 


GGTACTTCTG 


TTCCAAGTGC 


AAAGACAGGC 


AC AT C AATC T 


GTTGTCCCAA 


2580 


GGTCTTAAGC 


TGGTCAATGG 


CAGCTGGACG 


AT AAAT AT C C 


GCCGCAATCA 


TCAAAGGACG 


2640 


AGCATTTTCT 


TCTTTCTTGA 


GTTTGTTGGC 


CAATTTACCA 


GCAAAGGTTG 


TTTTACCAGC 


2700 


CCCTTGTAAA 


CCAACCATCA 


TGATGATGGT 


TGGAATCTTA 


GGTGACTTGA 


TAATTTCTGC 


2760 


CGTATCAGAA 


CCTAAAACGG 


CTGTCAATTC 


CTCATCAACG 


ATTTTAATAA 


TCTGTTGCGC 


2820 


AGGATTAAGT 


GTATCAATGA 


CCTCATGCCC 


GACTGCACGC 


TCACGAACTT 


TCTTGATAAA 


2880 


GTCCTTTACA 


ACAGGCAAGG 


CAACGTCGGC 


CTCGAGCAAG 


GCCAAGCGAA 


TTTCTTTGGT 


2940 


TGCCTCTTGG 


ACATCAGATT 


CAGAGATTTT 


TCCTTTTTTA 


CGTAGATTTT 


TAAAGACGTT 


3000 


CTGCAAACGT 


TCTGTTAAAC 


TTTCAAATGC 


CATTTTTCTT 


CCTCTTATTC 


TCTATTATCA 


3060 


ATGCTTGTTA 


AAATTTCTAT 


CTGCTCCTGC 


AGAAAGTCAT 


CCTTGGGATA 


GCGCTCCAAA 


3120 


AT C T GAT C AA 


AAATCTGACT 


GCGGACAATA 


TAGTCCGAGT 


ACATGTGCAA 


TTT CAT C TC A 


3180 


TAATCTTCCA 


GAATCTTTTC 


TGTTCGCTTG 


ATATTGTCAT 


AGACAGCCTG 


ACGACTGACA 


3240 


CCGAACTCCT 


CGGCAATTTC 


AGCAAGGCTG 


T AAT CAT C AG 


CGTAGTAGAG 


CTCGATATAA 


3300 


TTCATTTGCT 


TATCTGTCAA 


AAGCGCCGCA 


TAAAATTCAA 


AGAGCGCATT 


CAT AC GAT T G 


3360 


GTTTTTTCGA 


TTTCCATAAC 


TTTTATTATA 


CCAAAAATTA 


GCCTAATCTA 


CCACACTAGG 


3420 


AAGCCGATCC 


AAGAAGATAG 


ATAGCTAAAT 


T TG AAAAAG A 


CATGAGCCTA 


GCCCCAAGTA 


3480 


ATTTCCAATT 


GATAGCTGGC 


AAAGGG AT GT 


CCCTCTTGAT 


TTTGTAGTTG 


ATAATCTAGT 


3540 


TCAATCTTTT 


GCCTATCAAC 


TTGATAATGG 


CTCGTTTGGA 


TG AT AAACT C 


CTGCATGCCC 


3600 


ATAGGTGTAG 


GAATATAGGC 


TAAACTATCG 


CTATCCTTTA 


GAAAGCGCAT 


AATGGTCTTG 


3660 


GGATTAGAAA 


ATCGGCTCAT 


CACAAGTTCT 


TGACCATGAA 


ATTTAATCAC 


TACTTTTTCC 


3720 


TTTTCCTCAT 


TATAGAAAAG 


CAGGTAGCTA 


TAATCTCCTT 


TTTCATGCAC 


TTCCACATCA 


3780 


TAAAGCTGGT 


CAATCACTTC 


CAACTGCTCA 


TCAAACTGAA 


TCGTATTTCG 


CATCCGAATC 


3840 


T TC AC AT C AG 


GCCCTCTTTC 


TTGTCTCTTG 


TCCTACTATT 


TTACCAAAAA 


GAGCAGGATT 


3900 


TTGCTATAAT 


GGTCATATGA 


ACGAAAAAGT 


ATTCCGTGAC 


CCTGT TC AC A 


ACTACATCCA 


3960 


TGTCAATAAT 


C AAAT CAT CT 


ATGACTTGAT 


TAATACAAAA 


GAATTTCAGC 


GTTTGCGCCG 


4020 


GATCAAACAA 


CTGGGAACTT 


C C AGTT AT AC 


CTTCCACGGT 


GGAGAACACA 


GTCGCTTCTC 


4080 


TCACTGTCTA 


GGAGTCTATG 


AAATTGCACG 


ACGCATCACA 


GAGATTTTCG 


AAGAAAAATA 


4140 


TCCTGAGGAA 


TGGAATCCTG 


CCGAGTCTCT 


CTTGACCATG 


ACCGCTGCTC 


TCCTACACGA 


4200 


CCTTGGGCAT 


GGTGCCTACT 


CCCATACTTT 


TGAACATCTC 


TTTGATACAG 


ACCATGAAGC 


4260 
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C ATT AC TC AG GAG AT TAT T C AAAATC CTG A GACAGAGATT CACCAAGTCC TGCTACAAGT 4 32 0 

GGCACCTGAT TTCCCAGAAA AGGTGGCCAG TGTCATTGAC CATACCTATC CTAATAAGCA 43 80 

GGTCGTGCAG CTCATTTCTA GTCAGATTGA CGCAGATCGC ATGGACTATC TCTTGCGCGA 4440 

CTCCTATTTT ACAGGAGCAT CCTATGGGGA ATT TG AC CTG ACTCGAATCC TCCGAGTCAT 4 500 

TCGTCCTATC GAAAATGGTA TCGCCTTTCA GCGCAATGGC ATGCACGCCA TCGAAGACTA 4 5 60 

CGTCCTCAGT CGC T AC C AG A TGTACATGCA GGTTTATTTC CACCCCGCAA CACGCGCCAT 4 62 0 

GGAAGTTCTC CTACAGAATC TTCTCAAACG CGCCAAGGAA CTCTATCCTG AGGACAAGGA 468 0 

TTTCTTTGCC CGAACTTCTC CACACCTCCT GCCTTTCTTC GAAAAAAATG TGACCTTGAC 4740 

TGACTATCTG GCTCTGGATG ATGGCGTGAT GAATACCTAC TTCCAGCTTT GGATGACCAG 4800 

TCCTGACAAG ATTCTTGCAG ATT TAT CG C A TCGCTTTGTC AACCGCAAGG TCTTTAAATC 48 6 0 

CATTACCTTT TCACAAGAGG AC C AAG AT C A ACTTACTAGC ATGAGAAAAT TGGTTGAGGA 492 0 

TATCGGCTTT GATCCCGACT ACT AC ACTG C CATTCATAAG AACTTTGACC TCCCTTATGA 4 980 

TAT CT AT C GT CCCGAATCTG AAAACCCACG GACACAGATT GAG AT T TT AC AAAAAAATGG 5040 

AGAACTGGCC GAACTCTCTA GCCTGTCTCC TATCGTCCAA TCCCTTGCTG GCAGTCGCCA 5100 

CGGAGATAAT CGCTTTTATT TTCCAAAAGA AATGTTGGAC C AAAAC AG C A TCTTTGCAAG 5160 

CATTACCCAG CAATTTTTAC ACTTGATTGA GAACGATCAT TTTACCCCAA ATAAAAACTA 5220 

GAAGAGGAAA TTTATGAGTA TTAAACTAAT TGCCGTTGAT ATCGACGGAA CCCTTGTCAA 5280 

CAGCCAAAAG GAAATCACTC CTGAAGTTTT TTCTGCCATC CAAGATGCCA AAGAAGCTGG 53 4 0 

TGTCAAAGTC GTGATTGCAA CTGGCCGCCC TATCGCAGGC GTTGCCAAAC TTCTAGACGA 54 00 

CTTGCAGTTG AGAGACGAGG GGGACTATGT GGTAACCTTC AACGGTGCCC TTGTCCAAGA 5460 

AACTGCTACA GGACATGAGA TTATCAGCGA ATCCTTGACT TATGAGGATT AT CT AGAT AT 552 0 

GGAATTCCTC AGTCGCAAGC TCGGTGTCCA CATGCATGCC ATTACCAAGG ACGGTATCTA 5580 

TACTGCAAAT CGCAATATCG GAAAATACAC TGTACACGAA TCAACCCTCG TCAGCATGCC 564 0 

TATCTTCTAC CGTACCCCTG AAGAAATGGC TGGCAAAGAA ATTGTTAAAT GTATGTTTAT 57 00 

CGATGAACCA GAAATTCTCG ATGCTGCGAT TGAAAAAATT CCAGCAGAAT TTTACGAGCG 57 60 

CTACTCCATC AACAAATCTG CTCCTTTCTA CCTCGAACTC CTTAAAAAGA ATGTAGACAA 5820 

GGGTTCAGCC ATTACTCACT TGGCTGAAAA ACTCGGATTG ACCAAAGATG AAACCATGGC 5 880 

AATCGGTGAT GAAGAAAATG ACCGTGCCAT GCTGGAAGTC GTTGGAAACC CCGTTGTCAT 5940 

GGAAAATGGA AAT CC AG AAA TCAAAAAAAT CGCCAAATAC ATCACCAAAA CAAATGACGA 6000 

ATCCGGCGTT GCCCATGCCA TCCGAACATG GGTACTGTAA AAGTATCATT T T T C AAT AAG 6060 
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AATTG AT T AG CAATAAAATC CAATGAATTT TTTTAGCAAA CTATTTAATT TAAAACAAAA 612 0 

TAATCATAAT AGAGACACAA ATTCTGATTG TAACAATTTT TACCTAAACG AATTAGAATG 6180 

TGGCCTTACT CCTGGGCAAC T CAT AC T CAT AGATTGGACT CAAAAAACAG GGAGAAATTA 62 4 0 

TAATTTCCCA AGATATTTTA AATACTCTCT TCAAATTGAC CCTGAATCTA CACACAATCA 63 00 

ATTATACAAA TTAGGATACT TCACTAAAAA TAAGACTTTA TC AT AT CTT A CAGTAGTAGA 63 60 

ATTAAAAACT ATATTATCTA AACATAATTT AGCTACTTCT GGAAAAAAAG CAGAATTAAT 642 0 

TACAAGAATA ATTAATAATG TTAACATTGA CAATTTAGAT ATTCCGTTCG AATTTAAACT 6480 

AACAAAAGAA GCACAAAATC TTATTATCGA ACATAGTGAC TAT AT C AAAG CATACTATGA 6540 

TAAAGACATA ACTATGGAAG ATTATTGTAA AGAAAAAAAC AATATCTCTT TTAAAGCAAC 6 600 

TTTTGGTGAT ATAAAATGGA GTCTCTTAAA TAAACAAGCT CATAGGAATA CTGTATCAGG 6 6 60 

AGATTTTGGA TGCTTATCTA AC AC AC G AAA GGCTCAGGGA AGACATTTGG AACAAGAAGG 67 2 0 

TAATATTAAA CATGCTTTAA TATATTACAT AGAATCTTTG ATAATTACTA TTTCAGGATT 67 80 

AGAAAACAAT TTTTCAGCCA CTGATTATCC AGTATATTAT CCCGATTCGA TACCTGACTA 6840 

C T C ACT AAAA CATATTCAAA CATTAATGGA ATCATTATCT GATGACGATT ATGATTTTGC 6900 

TTTTGATGAA GCATTATTTC GCTTCTCAAT TTTGAATGCA AATCATTTTT TATCTAAGGA 6 9 60 

AGATATTGAC TATTTAAGAG TTAATTTACC TCGTTCCACT GCTGAAGAAA TAAACAATTA 7 02 0 

C TT AAAG AAA TATGAATGTT ATAGTCCTTT AAATAATTTA GAACTTGACG ATTTTGAATA 7 080 

AATTG ACT AT AC AAAC AT T T AT AT ACT C G A TATAGTCTCA ATTTTATCTG ATGATTGCCC 714 0 

AAATTTTTCA ATAATAAAAC GCATAATATT ATGGAGACAA TCCCCTATAT TATGCGTTCT 72 00 

TTTAATATCA AAGACTTTTT GACAAACTTC TTTGATATCT AATTACATGC CCCCTGCAGG 72 60 

AATCGAACCT GCAACTACTC CTTAGGAGGG AGTTGTTATA TCCATTGAAC TAAGGGAGCT 7 320 

AGATAAAAAC TCTGCTAAAT GAGCAGAGTT TTTTAGTCGA ATTAACGACG GATTTCTTTG 7 3 80 

ATACGAGCTG CTTTACCTTG AAGAGCACGC AAGTAGTACA ATTTCGCACG ACGTACTTTA 744 0 

CCGTAACGAA CAACTTCGAT TTTTTCAACA CGTGGAGTGT GGATTGGGAA GATACGCTCA 7500 

ACACCTACAC CGTTAGAGAT TTTACGAACT GTGTAGTTTT CTGAGATTCC AGCACCTTTA 7 5 60 

CGTGCGATAA CAACACG 7577 
(2) INFORMATION FOR SEQ ID NO: 47: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCTCGCTGAT GATTGGTGCT GTTTTATTTG CTGGTCCAGC CTTGGCTGAA GAAACTGCAG 60 
TTCCTGAAAA TAGCGGAnCT AATACAGAGC TTGTTTCAGG AGAGAGTGAG CATTCGACCA 12 0 

ATGAAGCTGA TAAGCAGAAT GAAGGGGAAC AT G CTAG AG A AAACAAGCTA GAAAAGGCAG 180 
AAGGAGTAGC GATAGCATCT GAAACTGCTT CGCCAGCAAG CAATGAAGCT GCAACTACTG 240 
AAACTGCAGA AGCAGCTAGC GCAGCTAAAC CAGAGGAAAA AGCAAGTGAG GTGGTTGCAG 300 
AAACACCATC TGCAGAAGCA AAACCTAAGT CTGACAAGGA AACAGAAGCA AAGCCCGAAG 3 60 

CAACTAACCA AGGGGATGAG TCTAAACCAG CAGCAGAAGC TAATAAGACT GAAAAAGAAG 42 0 

TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA ACCAAAGGAA ATCAAATTTA 4 80 

ATTCTTGGGA AGAATTGTTA AAATGGGAAC CAGGTGCTCG TGAAGATGAT GCTATTAACC 540 
GCGGATCTGT TGTCCTCGCT TCACGTCGGA CAGGTCATTT AGTCAATGAA AAAGCTAGCA 600 
AGGAAGCAAA AGTTCAAGCC TTATCAAACA CCAATTCTAA AGCAAAAGAC CATGCTTCTG 660 
T TGGTGG AG A AGAGTTCAAG GCCTATGCTT TTGACTATTG GCAATATCTA GATTCAATGG 72 0 

TCTTCTGGGA AGGTCTCGTA CCAACTCCTG ACGTT AT T G A TGCAGGTCAC CGTAACGGGG 7 80 

TTCCTGTATA CGGTACACTC TTCTTCAACT GGTCTAATAG TATTGCAGAT CAAGAAAGAT 84 0 

TTGCTGAAGC TTTGAAGCAA GACGCAGATG GTAGCTTCCC AATTGCCCGT AAATTGGTAG 9 00 

ACATGGCCAA GTATTATGGC TATGATGGCT ATTTCATCAA CCAAGAAACA ACTGGAGATT 960 

TGGTTAAACC TCTTGGAGAA AAGATGCGCC AGTTTATGCT CTATAGCAAG GAATATGCTG 102 0 

C T AAGGT AAA CCATCCAATC AAGTATTCTT GGTACGATGC CATGACCTAT AACTATGGAC 1080 

GTTATCATCA AGATGGTTTG GGAGAATACA ACTACCAATT CATGCAACCA GAAGGAGATA 114 0 

AGGTTCCGGC AGATAACTTC TTTGCTAACT TTAACTGGGA TAAGGCTAAA AATGATTACA 1200 

CTATTGCAAC TGCCAACTGG ATTGGTCGTA ATCCTTATGA TGTATTTGCA GGTTTGGAAT 12 6 0 

TGCAACAGGG TGGTTCCTAC AAGACAAAGG TTAAGTGGAA TGACATTTTA GACGAAAATG 1320 

GGAAATTGCG CCTTTCTCTT GGTTTATTTG CCCCAGATAC CATTACAAGT TTAGGAAAAA 13 80 

CTGGTGAAGA TT AT CAT AAA AATGAAGATA TCTTCTTTAC AGGTTATCAA GGAGACCCTA 1440 

CTGGCCAAAA ACCAGGTGAC AAAGATTGGT ATGGTATTGC TAACCTAGTT GCGGACCGTA 1500 

CGCCAGCGGT AGGTAATACT TTT ACT AC T T CTTTTAATAC AGGTCATGGT AAAAAATGGT 15 60 

TCGTAGATGG TAAGGTTTCT AAGGATT CTG AGTGGAATTA TCGTTCAGTA TCAGGTGTTC 162 0 
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TTCCAACATG GCGCTGGTGG CAGACTTCAA CAGGGGAAAA ACTTCGTGCA GAATATGATT 1680 

TTACAGATGC CTATAATGGC GGAAATTCCC TTAAATTCTC TGGTGATGTA GCCGGTAAGA 174 0 

C AG AT C AGG A TGTGAGACTT TATTCTACTA AGTTAGAAGT AACTGAGAAG ACCAAACTTC 1800 

GTGTTGCCCA C AAGGG AG G A AAAGGTTCTA AAGTT T AT AT GGCATTCTCT ACAACTCCAG 1860 

ACTACAAATT CGATGATGCA GATGCATGGA AAGAGCTAAC CCTTTCTGAC AACTGGACAA 192 0 

ATGAAGAATT TGATCTTAGC TCACTAGCGG GTAAAACCAT CTATGCAGTC AAACTATTTT 1980 

TCGAGCATGA AGGTGCTGTA AAAGATTATC AGTTTAACCT AGGACAATTA ACTATCTCGG 2040 

ACAATCACCA AGAGCCACAA TCGCCGACAA GCTTTTCTGT AGTGAAACAA TCTCTTAAAA 2100 

ATGCCCAAGA AGCGGAAGCA GTTGTGCAAT TTAAAGGCAA CAAGGATGCA GATTT CT AT G 2160 

AAGTTT AT G A AAAAGATGGA GACAGCTGGA AATTACTAAC TGGCTCATCT TCTACAACTA 2220 

TTTATCTACC AAAAGTTAGC CGCTCAGCAA GTGCTCAGGG TACAACTCAA GAACTGAAGG 22 80 

TTGTAGCAGT CGGTAAAAAT GGAGTTCGTT CAGAAGCTGC AACCACAACC TTTGATTGGG 2 34 0 

GTATGACTGT AAAAGATACC AGCCTACCAA AACCACTAGC TGAAAATATC GTTCCAGGTG 2 4 00 

CAACAGTTAT TGATAGTACT TTCCCTAAGA CTGAAGGTGG AGAAGGTATT GAAGGTATGT 24 6 0 

TGAACGGTAC CATTACTAGC TTGTCAGATA AATGGTCTTC AGCTCAGTTG AGTGGTAGTG 2 52 0 

TGGATATTCG TTTGACCAAG CCACGTACCG TTGTTAGATG GGTCATGGAT CATGCAGGAG 2 580 

CTGGTGGTGA GTCTGTTAAC GATGGCTTGA TGAACACTAA AGACTTTGAC CTTTATTATA 2 64 0 

AAGATGCAGA TGGTGAGTGG AAGCT AG CT A AGGAAGTCCG TGGTAACAAA GCACACGTGA 2 700 

CAGATATCAC TCTTGATAAA CCAATCACTG CTCAAGACTG GCGCTTGAAT GTTGTCACTT 2 7 60 

CTGACAATGG AACTCCATGG AAGGCTATTC GTATCTATAA CTGGAAAATG TATGAAAAGC 282 0 

TTGATACTGA GAGTGTCAAT ATTCCGATGG CCAAGGCTGC AGCCCGTTCT CTAGGCAATA 2 880 

ACAAGGTACA AGTTGGCTTT G C AG ATGT AC CGGCTGGAGC AACTATTACC GTTTATGATA 2 94 0 

ATCCAAATTC TCAAACTCCG CTCGCAACCT TGAAGAGCGA AGTTGGAGGA GACCTAGCAA 3 000 

GTGCACCATT GGATTTGACA AATCAATCTG GTCTTCTTTA TTATCGTACC CAGTTGCCAG 30 60 

GCAAGGAAAT TAGTAATGTC CTAGCAGTTT CCGTTCCAAA AGATGACAGA AGAATCAAGT 3120 

CAGTCAGCCT AGAAACAGGA CCTAAGAAAA CAAGCTACGC CGAAGGGGAG GATTTGGACC 3180 

TTAGAGGTGG TGTTCTTCGA GTTCAGTATG AAGGAGGAAC TGAGGACGAA CTCATTCGCC 3240 

TAACTCACGC AGGTGTATCA GTATCAGGTT TTGATACGCA TCATAAGGGA GAACAGAATC 3 3 00 

TTACTCTCCA ATATTTGGGA CAACCGGTAA ATGCTAATTT GTCAGTGACT GTCACTGGCC 3 3 60 
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AAGACGAAGC AAGTCCGAAA ACTATTTTGG GAATTGAAGT AAGTCAGGAA CCGAAAAAAG 342 0 

ATTACCTAGT TGGTGATAGC TTAGACTTGT CTGAAGGACG CTTTGCAGTG GCTTATAGCA 3480 

ATGACACCAT GGAAGAACAT TCCTTTACTG ATGAGGGAGT TGAAATTTCT GGTTACGATG 3 54 0 

CTCAAAAGAC TGGTCGTCAA ACCTTGACGC TTCATTACCA AGGCCATGAA GTTAGCTTTG 3 6 00 

ATGTTTTGGT ATCTCCAAAA GCAGCATTGA ACGATGAGTA CCTCAAACAA AAATTAGCAG 3 660 

AAGTTGAAGC TGCTAAGAAC AAGGTGGTCT ATAACTTTGC TTCATCAGAA GTAAAAGAAG 372 0 

CCTTCTTGAA AGCAATTGAA GCGGCCGAAC AAGTGTTGAA AG AC C ATGAA ACTAGCACCC 37 80 

AAGATCAAGT CAATGACCGA CTTAATAAAT TGACAGAAGC TCATAAAGCT CTGAATGGTC 3840 

AAGAGAAATT TACGGAAGAA AAG AC AG AG C TTGATCGCTT AACAGGTGAG GTTCAAGAAC 3 900 

TCTTGGCTGC CAAACCAAAC CATCCTTCAG GTTCTGCCCT AGCTCCGCTT CTTGAGAAAA 3 960 

ACAAGGCCTT GGTTGAAAAA GTAGATTTGA GTCCAGAAGA GCTTACAACA GCGAAACAGA 402 0 

GTCTAAAAGA TCTGGTTGCT TTATTGAAAG AAGACAAGCC AGCAGTCTTT TCTGATAGTA 4 080 

AAACAGGTGT TGAAGTACAC TTCTCAAATA AAGAGAAGAC TGT CAT C AAG GGTTTGAAAG 4140 

TAGAGCGTGT TCAAGCAAGT GC TG AAG AG A AGAAATACTT TGCTGGAGAA GATGCTCATG 42 00 

TCTTTGAAAT AGAAGGTTTG GATGAAAAAG GTCAAGATGT TGATCTCTCT TATGCTTCTA 42 60 

TTGTGAAAAT CCCAATTGAA AAAGATAAGA AAGTTAAGAA AGTATTTTTC TTACCTGAAG 4320 

GCAAAGAGGC AGTAGAATTG GCTTTTGAAC AAACGGATAG TCATGTTATC TTTACAGCAC 43 80 

CTCACTTTAC TCATTATGCC TTTGTTTATG AATCTGCTGA AAAACCACAA CCTGCTAAAC 4440 

CAGCACCACA AAACACAGTC CTTCCAAAAC CTACTTATCA AC CG ACT T C T GATCAACAAA 4500 

AGGCTCCTAA ATTGGAAGTT CAAGAGGAAA AGGTTGCCTT TCATCGTCAA GAGCATGAAA 4 5 60 

ATACTGAGAT GCTAGTTGGG GAACAACGAG TC AT CAT AC A GGGACGAGAT GGACTGTTAA 4 620 

GACATGTCTT TGAAGTTGAT GAAAACGGTC AGCGTCGTCT TCGTTCAACA GAAGTCATCC 4680 

AAGAAGCGAT TCCAGAAATT GTTGAAATTG GAACAAAAGT AAAAACAGTA CCAGCAGTAG 474 0 

TAGCTACACA GGAAAAAC C A GCTCAAAATA CAGCAGTTAA ATCAGAAGAA GCAAGCAAAC 4800 

AATTGCCAAA TACAGGAACA GCTGATGCTA ATGAAGCCCT AATAGCAGGC TTAGCCAGCC 4860 

TTGGTCTTGC TAGTTTAGCC TTGACCTTGA GACGGAAAAG AGAAGATAAA GATTAAATAT 4920 

CGAAAAATCT TGTGAAATCT TTCCG 4945 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25002 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 48: 

GACAACTCAA GTAGCTTTTT CTTATTTTGA AAAAGGAGAT CAGAGTTTAA CTATGTCAGA 60 

AAAATCACAA TGGGGGTCGA AACTTGGTTT TATTCTAGCA TCTGCTGGCT GGCCATCGGG 12 0 

CTTGGTTCCG TTTGGAAGTT TCCCTACATG ACTGCTGCTA ATGGCGGTGG AGGCTTTTTA 180 

CTAATCTTTC T C ATTTCC AC TATTTTAATC GGTTTCCCTC TCCTGCTGGC TGAGTTTGCC 240 

CTTGGCCGTA GTGCTGGCGT TTCCGCTATC AAAACCTTTG GAAAACTGGG CAAGAATAAC 3 00 

AAGTACAACT TTATCGGTTG GATTGGCGCC TTTGCCCTCT TTATCCTCTT ATCTTTTTAC 3 60 

AGTGTTATCG GAGGATGGAT TCTAGTCTAT CTAGGTATTG AGTTTGGGAA ATTGTTCCAA 42 0 

CTTGGTGGAA CGGGTGATTA TGCTCAGTTA TTTACTTCAA TCATTTCAAA TCCAGCCATT 480 

GCCCTAGGAG CTCAAGCGGC CTTTATCCTA TTGAATATCT TCATTGTATC ACGTGGGGTT 540 

CAAAAAGGGA TTGAAAGAGC T TCGAAAGT C ATGATGCCCC TGCTCTTTAT CGTCTTTGTT 600 

TTTATCATCG GTCGCTCTCT CAGTTTGCCA AATGCCATGG AAGGGGTTCT TTACTTCCTC 6 60 

AAACCAGACT TTTCAAAACT GACTAGCACT GGTCTCCTCT ATGCTCTGGG ACAATCTTTC 72 0 

TTTGCCCTCT CACTAGGGGT T AC AGT CAT G TTGACCTATG CTTCTTACTT AGACAAGAAA 780 

ACCAATCTAG TCCAGTCAGG AATCTCCATC GTAGCCATGA ATATCTCGAT ATCCATCATG 84 0 

GCAGGTCTAG CCATTTTCCA AGCTCGATCC CCCTTCAATA TCCAGTCTGA AGGGGGACCC 9 00 

AGCCTGCTCT TTATCGTCTT GCCTCAACTC TTTGACAAGA TGCCTTTTGG AACCATTTTC 9 60 

TACGTCCTCT TCCTCTTGCT CTTCCTTTTT GCGACAGTCA CTTTTTCTGT CGTGATGCTG 102 0 

GAAATCAATG TAGACAATAT CACCAACCAG GATAACAGCA AACGTGCCAA ATGGAGTGTT 1080 

ATTTTAGGAA TTTTGACCTT TGTCTTTGGC ATTCCTTCAG CCCTATCTTA CGGTGTCATG 114 0 

GCGGATGTTC ACATTTTTGG TAAGACCTTC TTTGACGCTA TGGACTTCTT GGTTTCCAAT 120 0 

CTCCTCATGC CATTTGGAGC TCTCTACCTT TCACTTTTTA CAGGCTATAT CTTTAAAAAG 12 60 

GCTCTTGCAA TGGAGGAACT CCATCTCGAT GAAAG AG CAT GGAAACAAGG ACTGTTCCAA 132 0 

GTCTGGCTCT TCCTTCTTCG TTTCTTCGTT TCGTCATTCC AAT CAT CATC ATTGTGGTCT 1380 

TCATTGCCCA ATTTATGTAA TCAAAAAGGA CTTGAGTAGT GAACTCAGGC CCTTTCTTTT 1440 

TATGGATGGC TAACAATCAA TTCCAAACCT TGCCCTTCCA GAGTCCAAGC TTCAACATCA 1500 

CTTGGTAGGA TAAAGTGGCT GCCTTTTTGA ATTGGATAAT TTTTCCCGTC AACAGTTAGC 1560 
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TGACCTTGAC CAGCCAAGAC ACTCAATAAG CTGTAGTCAG CTGTCTTTTC AAAGTCAACT 162 0 

TTTCCAGTAA TTTCCCACTT GTAAACTGCG AAGAAATCAT TAGATACAAG GAGAGTGGAA 1680 

CGCAAATCAT CTGCTTTAAC AGTTACAGGA CGGCTATTTG CTGGCTCACC AATGTTCAAG 1740 

ACATCGATGG ATTTTTCAAG ATGAAGTTCA CGCAAGTTGC CTTTGTCATC CTTGCGGTCA 1800 

AAGTCATAGA CGCGATAGGT GGTATCGCTA GACTGCTGGG TTTCAAGGAT TAAGATACCC 18 60 

GCCCCGATAG CGTGCATAGT CCCGCTTGGT ACATAGAAGA AATCTCCAGC CTTAACAGGG 192 0 

ACTTTGGTCA ACAAGTCATC CCAGTTCTTG TCCTCGATTT GCTGGCGGAG TTCTTCTTTT 1980 

GACTTGGCAT TGTGACCGTA GATAATCTCT GAACCTTCAT CCGCTGCGAT AATGTACCAG 2040 

CATTCTGTTT TTCCGAGTTC GCCTTCATGC TCGAGTCCAT AAGCATCGTC TGGGTGAACT 2100 

TGGACACTGA GCCAGTCGTT GGCATCGAGG ATCTTGGTCA AAAGTGGAAA TACAGGTTCT 216 0 

GGACGATTGC CAAATAATTC ACGGTGTTCC GCATACAAAG TAGCAAGATC TGTTCCCTCG 2220 

TAACGACCAT TGGCAACTTT AGAGACTCCA TTTGGATGGG CTGAGATGGC CCAATATTCT 22 80 

CCGATTTTTT C AC T TGGG AT GTCGTAGCCA AACTCATCAC GTAGCTTGGC TCCACCCCAG 2340 

ATTTTTTCTT GCATAACTGA TTGTAAAAAT AATGGTTCTG AC ATGT CG AT CTCCTGTCTG 240 0 

ATTTTTCTCC CCTCATTATA GCAAAAAAAG AGTTCGAATT GAACTCTTTT TTACATCTTA 24 60 

TAAAGCAGGG AGAAGATTTT ATAAAAATAG TAAACAAATG TGCTCTACCC GATGCTTGCA 252 0 

CCATTGCTAT AAATGACATC CTTGTACCAA TAGAAGGACT TCTTCTTGCT ACGTTTGAGA 2580 

GCTCCGTTTC CTACATTATC TCGATCTACA T AG AT AAAG C CATAGCGCTT ATTCATTTCC 2 640 

CCTGTGCCAG CTGAAACCGG AT CGAT AC AG CCCCAAGTCG TATAACCAAG CAAGTCAACC 27 00 

CCGTCTTGGT AAAT GG CATC TCGCATGGCC TTGATGTGGG CCTCTAAGTA AGTAATCCGA 27 60 

TAGTCATCTG CTACATAACC ATTCTCATCC GGTGTATCCA TAGCACCGAG TCCATTTTCT 2 82 0 

ACGATAATAC TAAACTAAAA TCAAAAAGCA TTATATAATA GTGATATGAA AT CAACT AAA 2 880 

GAAGAAATCC AAACCATCAA AACACTTTTA AAAGACTCTC GTACAGCTAA ATATCATAAA 2 940 

CGCCTTCAAA TCGTTCTATA GTAAAATGAA ATAAGAACAG T AC AAATC G A TCAGGACAGT 3000 

CAAATCGATT TCTAACAATG TTTTAGAAGT AGGGGTGTAC TATTCTAGTT TCAATCTACT 3 060 

ATATTTCGTC TGATGGGCAA ATCTTATAAA GAGATTATAG AACTTTTATA GTAGTTTGAA 312 0 

ATAAGATGTG AACAACTCTA TCAGGAAAGT CAAATTAATT TATAGAAATA TTTTAGCAGC 3180 

CAAGGTGTAC TGTTATAGAT TCAATACACT ATAGACTGTA ATCAAACAAC GATTTGGCGA 3240 

AATGTAAAAA AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC ACGTGGTGGT 3300 

CGTAACCATG CATATATGAC AGTTGAGGAA GAGAAAGCCT TTCTTGCCCG CCATTTGAAG 33 60 
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GCTACAGAGG CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA TAAAAAGGAG 342 0 

TTAGGTCGTT CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA TGGTTGGCGA 3480 

AATATTACGC CACGTCCAGA ACATCCTAAG AAAGCAGACG CTCAAACCAT TGTTGCGTCT 3 540 

AAAAATAAAA TCTCAATCCA AGAAGGCAAG AAAGCGTTTT AAATATAGTA GACGTTTTCG 3 600 

TAAGGTTTGC TTGATGTACC AAGCTGAAGC TGGTTTCGGT AGAATCAGTA AACTGGGATC 3 660 

TTGTTGGGCT CCAATAGGAG TAGGTCCACA TATCCATAGT C AC TAT AT AC GAGAATTTCG 3720 

CTATTGTTAT GGAGCTGTTG ATGCCTATAC AGGCGAATCA TTTTTCTTAA TAGCTGGTAG 3780 

ATGTAATACT GAGTGGATGA ACGCCTTTTT AGAAGAGCTT TCACAAGCTT ATCCTTTTAC 3840 

TCGTTATGGA CAATGCTATA TGG CATAAAT CAAGTACCTT AAAGATTCCG ACTAATATTG 3900 

GTTTTGCATT TATTCCTCCA TACACACCAG AGATGAACCC CATTGAACAA GTGTGGAAAG 3 9 60 

AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAT TTTGGAAGAT GTCATGAATC 402 0 

AACTCCAAGA TGTCATACAA GGATTGGAGA AGGAGGTGAT AAAGTCCATC GTTAATCGGA 40 80 

GATGGACTAG AATGCTTTTT GAAAGCAGAT GAGTATTATA TGCAATTTCT T T AT AT AAAA 4140 

AG ACCGG AT T GCTCCGATCT TTCAATAGTT CATATTCTCA ATTTCTATTT T AAAAAT AG C 42 00 

TAAGGTTAAC GTCAAATGAC TACGCGACCT AT T T CAT AC G AT AAAAAT C A AGCACTAGAC 4260 

CAGCAGGTCC TTGAACTAAT AAGGACTCTG TTCCCCAATC GGTTACAGTT GGTCCGTGTA 43 2 0 

AAACCTTTAT ACCAAGCTCG TTCAACCGTT TGTAGTTCTG GTCTACATCC TCAACCTCGA 43 80 

TATGAATAAT GATTCCTGAC TGAAAGTTTT CCAAAGGAAC CAAATGATTT TGTGACAACA 4440 

TAAGGCAGTG ACTACCAATC GTAAACTGAG C AAAAC CAT C ATTAGCATAA TCTGCCTTTT 4 500 

TATCCAAGAT ATGCTCCAAG T C AG C AC AG A CTTGGGGAAC ATTTGAAACG ATAATATCTA 4 560 

ATTGATTTAA ATTCATTTAC TCTCCTCCAT AAAAAGACCG GATTGCTCCG ATCTTTTAAA 462 0 

GTTCTGCTCT ATGAAAATCA AAGAATAAAG TCTACAAGTT TCATATTTGA TTTTCGGCGA 4680 

GAGGAATTAT TTAATTGCGC GTGATTGCAA TCCTTCTTCT TCCAAGAAGA GACGGAATGG 4740 

TACGAGTTCT TCTGCTTCGT ATTTTTCCTT GAAGGCTTTG ATAGCTTCTT CTGAGTGAAG 4 800 

TTTTGGATCC AATTCAAGTA CTTCTACTGG AAGTGGACGG TGTTGAGTGA TGCGAGCATC 48 60 

GATGACAACA GTTTTACCTT CTTTGTTCAA TTTAACAGCT TCTGCAACAA CTGCATCGAT 4 92 0 

GTCTTCGATA CGGTCAACTG TGAATCCAAC AGCTCCTTGA GCTTCCGCAA TTTTAGCGTA 4980 

GTCAGCGTTT GTGAAGTCTA CACCAAACAA GTGTTTGTTT GTATCTTCGT ATTTGTTCTT 5040 

GATGAAGCCG TACTCAGCAT TTGAGAAGAC AAGGTTGATA ACTGGAAGGT CGTATTGAAC 5100 
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GTTTGTGATA ACGTCTGGGT AGCACATGTT GAATGCTCCG TCACCCATGA TGTTCCATAC 5160 

TTGGCGATCT GGATTGTCTT TCTTAGCAGC GATACCACCA GGAAGGGCAA TACCCATTGT 52 2 0 

CGCAAAGAGT GGAGATGTAC GCCACATGTT CTTAGGTGTC ATGTGAAGGT GACGAGTAGA 52 80 

TGTTTGAGTA GTGTTACCTA CGTCGATTGA GTAGATAGCG TCTTGATCAG CATGTTTGTT 53 40 

GATTGCATTG TAAACTTGAT ACAATTGCAA TTCACCCTCA GTTTTACCTT CGAGTTTGTT 5400 

CATGTAATCA CGCCAGTTTT GGTTGTTCTT AACGTTTGCA CGCCACCATG GAGTTGATTC 54 6 0 

AACTGGGTTT ACTTTGTCAA GGATAGCTTT AGCTGCTTGA CCAGCATCAC CAAGGATTGA 552 0 

AGCGTCAAGG GCATGACGTT TACCAAGTTT GTAAGGGTCG ATATCGACTT GGATGAATTT 5580 

TTCAGTGTTC TTGAATGCTT CGTAAACTTC AGCAAATGGG AAGTTTGAAC CAAGGAAAAG 5640 

AACTGTGTCT GCTTCAAAGA CCACTTCGTT GGCTGGTTTC CAACCAACAC GGTAAGCAGA 57 00 

ACCTGTCAAA CCTTCATAGT TCCATTCGAA AGCTTCAAAG TTTTTACCAG TTGTGATGAT 57 60 

TGGTGCTTTG ATTTTACGTG ACAATTCAGT AATCACTTCA CCAGCTTTAA CACCACCAAA 5820 

TCCAGCATAG AT AAC TGGGC GTTCAGCATT GTTCAAGATT TCAACAGCTT TGTCGATTTC 58 80 

AACTTCGTTC AAAGCAGGAG CGATGAATGA GCGTTCGTAT GAACCTGAAC CGTAGTATGA 5940 

GTTTTCATCG ATTTCTTGGA AACCGAAGTT TACTGGAATT T C AAC AAC AG CTGGACCTTT 6000 

TTTAGAAACT GCAGCACGGC AGGCTTCGTC AATTACTTTT GGCAATTGCT CAGCGTAAGC 6060 

TACACGTTTG TTGTAAACAG CGATACCGTT GTACATTGGG TTTTGGTTAA GCTCTTGGAA 612 0 

AG CAT C C ATG TTCAATTCGT TAACTGGACG TGATCCAAGG ATCGCTAGGA ATGGAGTGTT 6180 

ATCCATAGCT GCATCGTAAA CACCGTTAAT CAAGTGAGTC GCACCTGGAC CACCTGAACC 6240 

AACTGCAACC CCGATTGAGC CGCCGAATTT AGCTTGCATA ACCGCTGCAA GAGCACCTGT 6300 

CTCTTCGTGG CGAACTTGTA AGAAACGGAT ATCTTTGTCT TCAGCCAAAG CGTCCATCAA 63 60 

TGAGCTGAGT GTTCCTGATG GGATACCGTA GATTGTATCT ACGCCCCATG TTTTCAATAC 6420 

GTTAAGCATT GCTGCAGATG CAGTAATTTT CCCTTGAGTC ATAATGATAA CTCTCCTTCA 64 80 

ATTTTTTTAA AC TTGG AG AA TACGATTACA TAGAATTGGA AACGTTCTCC AAATTTTTAC 6540 

TATTCCACTG TATCATATTT ATGCTGACTT TTCTAAAAAT CTGCTCAAAA CTCTCTATTC 6600 

TCTATTCTAA TACAGTTTTG AAAGTTCTGT CATTTCTGTT TTATAACAAA GAAATCTAGT 6660 

CATTACTTTT AGTCTATTTT ACTAAAATTT AACAGAAGGG AACTGGTCAG AACAGATACA 6720 

GAACTAAAGG CCATGGCTAG ACCTGCCAAT TCTGGGTTGA GAGCCAGTCC AACACCTGAA 6780 

AAGACTCCTG CTGCAATCGG AATTCCGACA ACATTGTAGA TAAAAGCCCA GAAAAGATTG 684 0 

AGTAGAATTC GATGAAAGGT TTTCTTACTC ATATCAAAGG CACGAACCAC TCCTAAAAGA 6900 



WO 98/18931 



PCT/US97/19588 



435 

TTATTGGTTG TCAACACCAA ATCTGCTGAC TCGATGGCGA TATCTGTTCC AG CT C C CAT A 6 960 

GCAATCCCCA CATCTGCTAC ACTAAGGGCA GGAGCGTCAT TGATACCGTC CCCAACAAAG 7 02 0 

GCTACTTTCC CTGACTGTTG CAGTTTATGG ATTTCATGGG CTTTTTCTTC TGGCAAGACG 7 080 

CCTGCAATGA CCTCTTCAAT TCCGATTTGA TCTGCAATAG CACGCGCCAC ACCAGCATTG 714 0 

TCTCCTGTCA GCATGACTGT TCGGAGACCA CGTTTTTTTA GCTGACTGAT GGCTAGCTTA 72 00 

GCATTTTCCT TAGGAATATC TTGCAAAGCA AGCAAGCCTT TGATTTCATT GTCAACAGCT 7 2 60 

AAGAACACAA CTGTCTTAGC TTCTTTTTCT AGTTCTTCTA GTTTATCTTG ATAAGTATTA 7320 

GAAATATCCA TGCCATCCAG CATTTTAGCA TTTCCAAGTA AAACTTGTTT T C C AT TG AT T 73 80 

CGCCCTGAAA CACCTTTCCC GTGCAAGGAC TGAAAATTTT CAACAGTTTG AAACTCAAGT 744 0 

CCAGCTTCAC TCGCTCGCTT AACGAT AG C C TCAGCCAGTG GG T G T TG AG A AGCATCTTCC 7 500 

AAGGAGGCTG CCAACCCAAA CACTTCTACT TCGTCGCCGA TGACATCTGT TACCACAGGT 7 5 60 

TTCCCTTCCG TCAAAGTCCC GGTCTTATCA AAGACAAGGG TTTGAACTTT CTGGATTTCC 7 62 0 

TGTAAGACAG TTCCATTTTT GAGGAGAACC CCCATCTTGG CACTACGTCC TGTCCCCACC 7680 

ATAAGGGCTG TCGGTGTTGC AAGTCCCAAG GCACAAGGAC AGGCGATAAT CAAAACCGCC 7 740 

ACT CCGT AG A GAAGAGAGGA CACAAAGCTA GCTCCAAGCA CAACCACACT ATCCCTGAGC 7 800 

AAGACGAACC AAACCCAAAA GGTCATGATT CCTAAAATGA CAACTACT GG G AC AAAAAT C 7 8 60 

CCTGAAATCT TATCCGTCAA GTCCTGAATC GGCGCACGAC TTGTCTGAGC TTTCTTCACA 7 92 0 

AAATCCACAA TCTGAGCCAA AACAGTCTCT GAGCCAACTT TTTCTGCTCT AAAGACAAGC 7980 

GTTCCACTAT GATTGATGGT TGAGCCAATG ACAGTATCTC CAACTGTCTT GTCCACAGGC 804 0 

AGACTCTCAC CTGTCACCAT GGATTCGTCA ATACTAGAGA CACCTTCTAC TACGACACCA 8100 

TCAACAGCAA TCTTTTCACC GGGACGCACT CGAATCAGGT CGCCTACCTT GACTTGTTCC 8160 

AAAGGAACTT GG AC AT AAC T ATCATCACTC AAGACTTCTG CGGTTTTAGC TTGCAAGTCC 82 2 0 

AGTAATTTCT CCACAGCTTG GGACGTATTT TTTCTCATTT TTTCCTCAAA AACTGCTCCC 82 8 0 

AAAAGAACGA AAAAGAGGAT AAATCCAGCA CTTTCGAAGT AAACAGGGAG ACCAGCAAAG 83 4 0 

AGAGCAACTA GGCTATAGAA ATAAGCCACT AGAGTTCCCA GCGCAACCAA G GT AT C CAT G 8400 

TTGGCATTGT GCTTTTTAAA ACTGGCCCAA GCACTCTGGA TATATGGCTT ACCTGCAACT 8460 

AACATAATAG GCGTTGTTGC TAGAAAGGTT CCCCAATGCA TGACTTGATG ACTAATGCTA 852 0 

CCTGTCAACA TCCCAATCAT GAGAATCACA AGAGGCACAG TAAAGATACT AGTAATCCAA 8580 

AAACGTTGCA GGAGAGATAG AGATTTTCGA GTCTTCTCAA CGACTGTATA GCTTCCCTTT 8 64 0 
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TGCATCTTCA TGCCACAAGA AAATTCATGT CGCCCTAATT CTTGAGGCGT AAAACGAATG 87 00 

ACTTTCTCCT CATCTACGCC GATTGGTTCC AAGATACCTT CTTCTTCAAA CAGAATTTCC 87 60 

TTATAACAGT TTGAAGGAGT AGCACGATGA AAGGTAATCT CAGCTGGAAT TCCCTTTTGA 8820 

AG C TGG AT AT GGGCTGGATG ATAGCCTTTT TCAGCTCGGA TACGGATTTT TTGAATGCCA 88 8 0 

TTTTCTAAGC TTGCTTTCAC AATTTCTGTC ATAGTCTCCA CCTACTCTAC AATCATCTTG 8940 

CCGTGCATCA TGTTCATACC ACAAGCAAAG CCAAACTCTC CAGCCTGTTC AGGCGTGATT 9000 

TCCACTACAT ACTCTTCCCC CATTGGCAGG TTCGCATGTA CACCAAAATC TGGAAAAACA 9 0 60 

ATTTGATCCA GACATGGTGA AGGATCCTTG CGGTCAAAGA CAATGCGTGC TGGCACTGAT 9120 

TTCTTGAGGA CAATCAACTC AGGAGTATAG CCTCCCATGA CTTCCACTCG AATCTCTTGG 9180 

TATCCGTTTT TTTGCTGGGC TTTTTGTCCA GATTTTTCAG GCTTTTTGAA AAACCAAAAC 92 40 

AAGATAAACG CGATAAGGGC AATACAAATA ATGGTTACAA TACTATTTAA CATGACGTCT 9 3 00 

CCTTTACATA CAATTACATC TTACTTCTGT TACAGCACTT GATTTCTTCT CTGAAATCAC 93 60 

AGCTTCCAAG TCTTCCAAGT CAGTCTGAGT AAATTCACAT TCTACAATCA AGTCAGCCAA 94 2 0 

CAAATTCCTA ATCCTACGGG AACAAACCTT GTCTTTGATA TCTTGGACAA GTAAATCCCG 9480 

ACTTTGGTCT AGAGTTAAAA GGGCTGAATA AACAAAGGAC TTGCCTTCTT TTTTCCGAGT 9540 

CAAACACTCT TTATCAACCA GACGAGCCAA AAGTGTCTGA ACCGTGGACT TGGACCAGTC 9 6 00 

AAACCGCTCT GCCAAAACCC TAATCAAATC TGTACTGGTC TGCTCCCCCT GCATCCAAAT 96 60 

AATCTTCATG ACCTGCCATT CTGCATCTGA AATCTGCATT ACCATACCTC CAAAATCTAC 972 0 

ATTTGTCAAT TACACTCATC AGTATACTCT TAAAATCTAC ATTTGTCAAT TATAGAAATA 97 80 

ATATTTTCTT CGAAAAATAG AATTTTAATC ATTTGAAAAA CGATTTGCAG TCAAATATTA 9 84 0 

CTATATAAAC AATAAAAATA TG CT AT AC T A AAGAAAAAAG AAAACAACCA CTAGGGGTGC 9 9 00 

GTAAAGCTGA GATTAACGAC TGTTAGATCC CTCTGACTCA ATCTAGGTAA TGCTAGCTGA 9 9 60 

TGGAAGTGGA AATGATAATG GGGACTAGCA GTCTTCTATT GCCTTTCTAA AACAGACTAG 1002 0 

CTTGTTCTTA AGAATACAAA CTTCAGTTGG TTGGGAGGTT T TAG AT G ACT TATTTACCCG 10080 

TTGCTTTGAC CATTGCAGGG ACTGACCCTA GTGGTGGTGC TGGCATTATG GCAGATTTAA 1014 0 

AGTCATTCCA AGCGAGAGAT GTCTATGGAA TGGCTGTTGT AACCAGTCTT GTCGCTCAAA 10200 

ATACCAGAGG TGTTCAGCTA ATCGAGCACG TTTCTCCTCA AATGTTGAAA GCCCAATTGG 10260 

AGAGTGTCTT TTCTGATATT CCACCTCAGG CTGTAAAAAC TGGAATGTTG GCTACTACTG 1032 0 

AAATCATGGA AATCATCCAA CCCTATCTTA AAAAACTGGA TTGTCCCTAT GTCCTTGATC 10380 

CTGTTATGGT TGCTACAAGT GGAGATGCCT T GAT T G ACT C AAAT GCT AG A GACTATCTCA 10440 
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AAACAAACTT ACTACCTCTA GCAACTATTA TTACGCCAAA TCTTCCTGAA GCAGAAGAGA 10500 

TTGTTGGTTT TTCAATCCAT GACCCCGAAG ACATGCAGCG TGCTGGTCGC C TG ATT T T AA 105 60 

AAGAATTTGG TCCTCAGTCT GTGGTTATCA AAGGCGGACA TCTCAAAGGT GGTGCTAAAG 1062 0 

ATTTCCTCTT TACCAAGAAT GAACAATTTG TCTGGGAAAG CCCACGAATT CAAACCTGTC 10680 

AC AC C C ATGG TACTGGATGT ACCTTTGCTG CAGTGATTAC TGCTGAACTA GCCAAGGGCA 10740 

AGAGTCTTTA CCAGGCAGTT GATAAGGCCA AGGCCTTTAT CACAAAAGCT ATTCAAGATG 10 800 

CCCCTCAACT CGGTCATGGT TCTGGTCCAG TCAACCATAC AACTTTTAAA GATTAAGAAA 10 8 60 

AAAAACTCTC TAGTTCCCAC TTTAAGGGAA T TAG AG AG T T TTTATACTCT TCGAAAATCT 1092 0 

CTTCAAACTA CGTCAGCTTC CATCTGCAGC CTCAAAACAC TGTTTTGAGC TGACTTCGTC 10980 

AGTCTTATCT AAAACCTCAA GGCAGTACTT TGAGCAACCT GCGACTAGCT TTCTAGTTTA 11040 

CTCTTTGATT TTCATTGAGT ATTAATTAGG AAAGAATGTT ATGCAACTTT TTTAAAAAGG 11100 

CTTGCGTTTT TGCCTCAATA TCTTCTGCTT GCATCAAATC ACGTACAACA GCTACACCAG 11160 

CTATGCCAGT GCCCATAAGC TGATCAATAT TCTCCGAAGT CAAGCCTCCA ATAGCAACTA 112 2 0 

CTGGAATGGC AACCGTTTGG CAAATTGTTT TCAAGGTCGA TATCAGAGTA ATGGGCGCAT 112 80 

TTTCCTTGGT GGTGGTTGGG AAAATGGCTC CTGTACCCAA GTAATCTGCA CCTGATTTCT 11340 

CCGCTTCCAG AGCTCTTTTA ACCGTTTTAG CGGTGACACC GAGGATTTTT TCAGGACCCA 11400 

AGACTTTGCG AGCTACCGAA ACTGGTAATT CATCATCTCC G AT ATG C AG A CCTGCTGCAT 114 60 

CAACCGCAAG AC AAAC AT C C AACCGATCAT CGATTATCAA GGGTACCTGA TAAGCATCTG 1152 0 

TTATTTCCTT GACTTGTTTT GCCAGTTGAT AATATTGATT GGTTGTGAGA TTTTTTTCTC 11580 

GCAATTGGAC TATGGTAACC CCTGAACGGC AGGCCGTCTC AACTTTTGCA AGAAAGCTTT 11640 

CCACGGAATC TTGATAGCGA TTGGTTACCA GATATAGTCT AAGTGCTTCT CTATTCATAA 117 00 

ACCTCTCCTT TGATGGTATC TAGCCAATTT TCATCTCTTC TTAGGAGCGA AAGCTGATTG 117 60 

AGTACTTGGT AACGAAATTC TTCCAATCCC ATTCCTTGAA CAACTATTTT CTCAGCAGCG 11820 

ATATTGAGAT AAGAGACTGC TAAGCAAGAA GCTTCAAAAC CAGTCTTTCC TTGGCTGAGA 11880 

AAAACAGCTG TTAAGGCTCC AACCAAGTCT CCTGTCCCTG TTATCCAGTC TAATTCAGTA 11940 

CAGCCATTTC CCAGTACAGC GACCTGATTT TTCGAAACGA CGAGGTCCTT GGGACCTGTG 12 000 

ACTAAGAAAG ACATACCAGG AT AGGT CTG A CACCAGTCTT TCAAGACTTG AAGCAAATCC 12 060 

TCCGTTTCTT GATCTTTAGC ACTCGCATCG ACCCCAACGC CGTGGTGCTT TAATCCAACA 1212 0 

AGACTTCGAA TTTCTGACAT GTTTCCTTTA AGGACCGTAG GTCTATAGTC TAAAAGGTCT 12180 
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TTAACTAAGC 
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CTGAATGAGA GGAAAAGAAT TGAATACCAC AATCAAGGCA TAGGACCAAG AGCGTGATAG 14 040 

CCCCTTTTGA GCCAAGTACA AGAGAAGCTC TTTTAGTGAA ACAGAGGAAA CAAAGACAAG 14100 

GCCGATACAA ACTGTCACAA AGGCCCTCGT TCCAAGCATG ACTGCCTGTG AAGCATCTCC 14160 

GTGTAACTGA ACTGCCCAGT AGTTGGCAAA AGATGGTAAA ATGGCAAGTA TGATCATCCA 142 2 0 

AGCTAACATT TTAAATCGAC GGTAATAGAG CATAAAGAGA ATACAAAATG CGACTACCGA 142 80 

AAGAGTCAGA GCAATCGAAG GAATGAAAGA TGTTTCCAAG G AT AAAAT C A GCAAGAAGAG 14 340 

ACT G ATAAT C GGTGTCTGGG TTGCTACTTT G AC CAT AC T A TCTCACCTCC CCTTGGGTAT 144 00 

TGCTACTCTG AGATGTAAGT GGTTTGGTAA TGGTCACTTC TTTCACATGC CGAAGACCCT 144 60 

GACTAGTCAT CTCAATCCAA TAATCAACCA CAGAAATCAA AGGGTCTAAA CGATGACTAA 14520 

TGAGCAGAAA ACTTCTTCCT TGATTCCTCT CCTCCACAAT CCACTTGCAA AAATAATGGC 145 8 0 

AGGCTCTATC ATCCAAACCT GCAAAAGGTT CATCTAGCAA GATCACGGAA GCCTTACTGG 14 640 

TCAAGATGGT CAGGAGCTGA AGAATTTTTT GCTGACCACC ACTTAATTGA TAGGGACTCT 14700 

TATCGACTGC CTGCTCCAAA TCAAAATATC GTAAAGCTTG AAAAATCCGC TGATTTCTTT 147 60 

CAGAATCAGG TCCATCTAAT TGAAGCTCCT CTCGCAGACT G AC T C G GAT A AACTGCTTCT 14 82 0 

CAGCTTCCTG AACAACACCA GTCAGATCAC GATACAAACT CTTTTTCTTT TTCAGGACCG 148 8 0 

AACCCTTCCA AGTAATGCTC CCCTTATACT TTTGAAATTG AAGAATAGAC CGAAAGAGGG 14940 

TTGATTTCCC G AC AC C AT TG TCACCCAGGA TACAGGAAAT C C CT T GAT AG AATGTGAAAT 15 000 

CAGCAATTGA AAAGAGGGGG CGATTACCAA GCTCACCAGT CACACGGTTC ATATGGAATA 15060 

GTTCCGGGCT AGAAGCAACT TCCTTTGAAG CAACCTGTGT CATCTCATAG GAAGGGATTT 1512 0 

GAAACACTTC CCTTAGTTTT CCGTCTCTTA GCTCCACCAT ATGGTCGATA TAGGCTTTAT 15180 

AGTCAGATAA ATCATGGTCG CACAAAATAA CTGTCTTCCC ATCATAGACC AACTCTTTTA 1524 0 

GAATCTCCAA TATCTCGATT CTGCTCTTGC GGTCAATGGA AGCGAAGGGC T CAT C C AAG A 153 00 

GAT AG AC C CT AGG AT T CAT G GCAAAGAGGA CAGCCAGCGC TGCTTTTTGC TTTTCCCCAC 15360 

CTGATAAGTG ATGGATGAGA CGGTGCAAGA TGTCCTTGCA ACGACATTGC TGGACAACCT 15420 

CTGCTATTTT AGAATCAATT TCCTGAAGGT G ATAGCC G AT ATTTTCCATG GTAAAAACCA 154 80 

ACTCCTCAAA CAAGCTCTCC ATGGTAAATT GATGATTAGG ATTTTGCAAG AGAATACCAA 15 54 0 

CCGTCTGGAC ACGTTCGACG AT AG AAAGC T GACTGACCTC GCTCCCATCT ATCAGGACTT 15 6 00 

G AC CGCT AT A GGGAAGAGAA CTAACTTGGG CAATCATTTG AAAGAGGCTG GATTTTCCAG 15660 

ACCCACTACT CCCAACTAAC AAGGTAAAGG CTTGCGCATG AAAAGTAAAA TCAAACGGCT 157 20 
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CAGAGAAGAT TGGGGACTGA ATCGCTCGTA GTTCCAGACC CATCTATGCT TTTCCTCCAG 157 80 

TTGCAAACTG ATGATAGAGT TTGACAATGG CACGAACCAA GATGGTACAG AAGAAATAAA 15 840 

CAGAAATAAA ACGT AC C AC A AGCAAGGAAA GGACAAACGG AAGGGAAAAG GCGTAGTAAC 15900 

CTAACTTAAT GTATTCATAG ACAAAGCTAA CAAGCGTAAT CCCAATACTA TTAGCAGTTA 159 60 

G AG AGAGC C A ACTTTCATAG CGATTCTTAG TTACGATAAA ACCAAATTCA CTTCCCAAAC 16 020 

CTTGAACAAA GCCAGACAAA AGAGCTCCTA GACCAAATTG GCTACCATAA AGGACTTCAG 16080 

CAAGCGCAGC TAGCACTTCT CCAATCGTTG CACTTCCGAC TCTCGGAACA AAGATGGCAG 16140 

CAATGGGCGC AGCCATACAC CAGAGACCGA AGAGGATTTC ATTGGCAAAG GCCTGCAAAC 16200 

CAAGAGGTGT TAAGAGTAGA CTGAGAATAT TATACACATA TCCTGAACCA ACGAAAACCC 162 60 

CACCAAAAAA GATAGACAAG AAAGCAAGCA AGATAACATC TTTTAACTGC CATTTTTTCA 16320 

ACATAAAAAA CTCCTTTTTT TAAAGAAAAG TGAGGCACTC AAGAAGACCG ACCTAAATAC 16380 

TTTGTATAGC AGACTGAATT TAGAACAGTA CACAAGAACA CTAAAATATT TCTAGAAATT 16440 

AATTTGAATT TTCTAATTGA TTTGTTCGCA TCTTATTTCA AT C T AC TAT A TCATCTTCAT 165 00 

CCAGTTTCGT AAAAGAAAAA ACTCTAATTA CAGATACAAA TTAGAGTTCA GCTTACAAGA 165 60 

TTAGACAGTT CTTTTCGACA TACGAAAAAA ACATTTCACA TTTCCCTTCG CCAGTCTTAA 16 620 

CTGTATCAGG TTCAATGGGT ATCATCTCAG CCTAAAGCAC CCCAAATGTC TTTATTATTT 16680 

AATTATGTGA TTATTATAAC ACACATTTTA TACTAGTTCA AGAAATTGAA CTGGAAATAC 1674 0 

AGCCTTGCAC TCACAAAGAC AGCAGATCTT TCTTTTGCAA AAAACAAATG ACCTGTTTGA 16800 

TGAATTAGCC ATTCAAGCTG AAT CTGG AC A TAGCTTTTTA AAAAAGGAAA ATCCTACTTA 168 60 

CTTAGAATCC AAGGATAGAT AT C T ATTGTT CACTCATTTC CCGAACAGTT TTTTCTATAT 16 920 

TTTTTGCATA CGATATTGCC GAAATGATTG AAACGCCATC CATATTGGTC TTTATAATGT 16980 

CTTTAATATG TTTCGTCTGT ATCCCACCAA TTGCAACTAA AGGCATTTGT GGCAATAGTT 17 040 

TTCTCATCAA TTCAAGACCT TCATAACCTA TAGTACCACC AGCATCATCC TTTGACTGGG 17100 

T AC C AAAT AC AGGCCCAACA CCTACATAAT CTACATATTC AACTTTTGAT TGTTGAAATT 17160 

CTTCTTCGTT TCTTATAGAA AGACCAATTA TTTTATCTGG CATCAATTTT CTAATTTCAT 17220 

CAACACCAAT ATCATCTTGA CCTACATGTA CGCCATCGGC GTCAATTTCC ATTGCTAAAT 172 80 

CTATATCGTC ATTAACGATA AATGGAACAT TGTATTTTTT ACAAAGTTCT T T AATTTGG A 173 4 0 

TAGCTAGCTC AAGTTTTTCT AAGCCTTCTA AAGCACCCTC ACCTTTTTCT CGAAATTGAA 17400 

ATAAGGTTAT ACCACCTTTT AAGGCTTCCT CAACGACTGT ATATAGATTT TTTCCTTGGC 174 60 

AAGTAGTCGT T C C AC AAAT A AAATATAGTT TTAGTAATTC TTTATGAAAC ATCTTACTTC 17520 
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ACTCTTTTGA ATTCCTTTAC ATCTTCATCT GTAATCTCGT ATAAGGCATT TATAAATTCA 17580 

ACTTTAAATG TCCCAGGAAG ATGTCCATTT GGACGTTTTT CTGCTATTTC TCCAGCGATA 17 640 

TTGTAAACCA ACACTGCTGT TTTTAATGAT TTCAATTCTT GACCTTTTTC TAGTCCGATA 177 00 

AAGCTTGCTA CTACAGCTCC TAATAAGCAT CCTGTCCCAA TGACTTTCGG CATCATAGCA 177 60 

CT AC C ATT AT GAATCATTAC CACTTCTCCA TTAACAGCAA TGGCATCCAC TTCACCTGTT 17 82 0 

ACTACTATTG GAATATTGAA CTTCTCATTT G CT G CT AG AG CAATTTCGTC AATATTATCT 17 880 

ACGCCCGCAC TATCTACTCC TTTAGATGCC ACATCTATTC CTACTAAAGA GGCAATCTCG 17 940 

CCAGCATTTC CTCTAATCGC TGCTAGTTTA TAATTGTTGA TT AG AT CAT C TGCTACTTTT 18000 

TTTCTATATT CTCCTGCTCC ACAGGCTACA GGATCTAAAA CTGCTGGGAC ATTATATTTC 18060 

TCTGCAATTT TCAGAGCAGC TTGGTATAAT TTCCAATTTT CATCTGTCAA TGTTCCTATG 1812 0 

TTTATTAATA AACCACCAGC ATACTTTAAC AAATCCTCTA AATCTGCTGG AAACTCACTC 18180 

ATGGCTGGTG AGGCGCCCAG T GCT ACT AAT CCATTTGCTG TGAAATTTTT TACTACATCA 182 40 

TTGGT T AT AC AAATGACCAA TGGTGCTTTT TCTTTTAATA ATTTTAAACT TGTCATATTG 18300 

AAATCCTTCC TTTTCACTTT ATACGATCTA CTAATTTCGA TTTATCTTTA GTTGAGAATT 183 60 

TTTTTCATTT ACATTGAATG ATTTATACTC AATGAAAATC AAAGAGCAAA CTAGGAGGCT 18420 

AACCG C AGGT TGCTCAAAAC ACTGTTTTGA GGTTGTGGAT AGAACTGACG TGGTTTGAAG 18480 

AGATTTTCGA AGAGTCTTAC CTCATCAAAT TTGTAAATAT CATGAGC CTT CTCTAGACAT 18 540 

CGTAACCAAT ATCAAAAAAA GCTAATTCTA AAGCGACTGC TTGATTCCAG CGTTGCTGAA 18600 

GTTCTGTCAA ATCTTCTCGA TTTTTACCGA CACGATTGAG TTCGTCAACC AGAAATTGAA 18660 

CCCACTCTGC AAAGAAAGGA CCTCTGTGGA GATTGATCCA TTCCGAATGA ATATAGACTT 187 2 0 

CAGGTAAAGC CAAATCTTTA GAACCCCAGT CTAAATAGAG ACCTTCTGCA ATGACCAGCA 187 80 

TGACCAAAAG ATGGGCATAG TCTGATGAAG CCACCGCCGA ATACATTAGA TCCTGAAAGG 18840 

CTTTTGTTAC AGGGTGCAAA GTCACTTCTA GATAGTCATT CTCTGCTACT TTTAACTCTT 189 00 

TAAAAGCCTT TTGGAAATAA CCATCTTCAT CTGCTTCAAG AAAGCCTAGT TGCTTGGCAA 189 60 

AACGAAGCTT GGATTCAAGT TTATCTGCGT GACTACGCAG GCACCCAGCA TGGATAAGAA 19020 

GGCATCAAAG AAGTGATAAT CTTGAATCAG ATAGTCCTTT AAGAC C TT AT TCTCAATTGT 19 080 

CCCCGCAAAA AGTTCCTTAA C AAAACG AT G ATTGATTGCA GCCTGCCAAT C CTT CT G AC T 19140 

GCTTTTTAAT AATTCTCCAA CAGTCAAACC TGGCTGAAAT GCATAGTCTT GTGTTTCCAT 19200 

ATTTACTTCT CCTCTCTTTA CTTGTTAGTA ATTAATAAAA C AC C AAG AAA TATCAAGCAA 192 60 
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AATCGTAATT C C ACT TG AT C CTTTTAAAGC ACATCGAGAG CATTTGCAGA GAGCTAACTA 19320 

AACAAGCCTA TCCAGTTTAT ATAAACAAAA AACTCCAATT ACAATCAAGA ATTAGAGTTG 19 3 80 

ACTTACAAGA TTAGACCGTT CATTTCACCA TACGAAAAAA CTGTTCACAT TTCCCTTCGC 1944 0 

CAGTCTTAAC TGTATCAGGT TCAATGGGTA TTATCTCAGC CTAAAGCACC CCAAATGTCT 19500 

CTATTATTTA ACTACTGAAC C AGT AT AG C A AAAAATGAAA GCCCTAGCAA GATATTTGAC 195 60 

C G AAAAAT AT CTTTATATAT AATATATTGA AACTAGAATA GTACACCTCT ACTTATAAAA 19 620 

CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC CTATTCTTAT TTCATTTTAC 19680 

TATAGTTTTC GATAGCAATT TATTCTTCCA ATACACGAAG AAAAACCTCC AC ATT C AGT G 1974 0 

GAGGCAATCT GTTTTATCAA TACAATTTTA AGTCACGAGG GTCAACTGGG AAGGTTGGGT 19800 

TGTATGGATT GTGACGGAGC TTGAAGTGTT TG AC AT C T T C AATGGTCTGA GT T C C AG AC A 19 8 60 

ATTGCATAAC TGTCTTCAAT TCCGCATTCA AGTGTTCAAA GACTTGACGC ACACCGACAC 19920 

TACCACCGAG AGCCAAGCCA TAGATGACAG GGCGTCCAAT AGCAACCAAG TCTGCTCCTG 19 9 80 

ATGCCAAGGC TTTAAAGACG TGTTGACCAC GACGAACACC AGAGTCAAAG ACAATCGGCA 2 0040 

CACGTCTATC AACTGCTTCT GCCACTTCTT GAAGCGAGTC AAAGGCAGCT GGTCCACCGT 2 0100 

CGATTTGACG ACCACCGTGG TTGGTTACCC AGATACCAGA AGCTCCTGCA GCAAGCGAAC 20160 

GTTCAACGTC CTCACGGCAT TGTGGTCCCT TGACATACAC AGGAAGACCA G AGT ATT C AG 2 0220 

CGATAAATTC TACATCGCGT GGAGACAAGC GTTGTTTAGC TGATTTGTAA ACAAAGTCCA 2 0280 

TTGATTTACC AGCACCTTCT GGCAGGTATT CTTCAACAAT CGGCATGCCA ACTGGGAAGA 2034 0 

CAAAACCATT ACGCTT AT CC ACTTCACGAT TCCCCCCTAC AGT AG C AT CT GCCGTCAAGA 20400 

CAATCGCTTT ATAACCTTCA GCCTTCACAC GGTCCATGAT GTGGCGGTTG AT AC CGT C AT 2 0460 

C CT TACT AAA GTAAAATTGA AACCAATGAG GTGTCCCTTG GAGGGCTTCA GAAATCTCTG 2 052 0 

GAAGGTCAAC AGTAGAGTAA GAACTGGTTG TATAAAGAGA ACCAAACTCA TGCACACCAC 2 0580 

GCGCAGTCGC CACTTCCCCC TGTTCATTTG CCAATTTATG AGCCGCAACA GGTGCCATAA 2 0 640 

TGATTGGAGA AGATAGTTTT TCACCTGCAA ATTCAATCTC TGTACTTGGA T T TT C T AC AT 2 0700 

TGCAAAGTGT ATGAGGAACG ATGAGCTTGT GGTTAAAGGC AC GG AT ATT C TCTCTTAAAG 2 0760 

TGAAAGTATC TTCCGCCCCA CTAGCGATAT AGCCAAATGC TGCTTTAGGA ATAACTTGTT 20820 

GCGCCATTGG CTCCAAATCA TAGGTATTGA TGAArTCTAC ATGACCTTCT GCATTGCTTG 20880 

TTTTGTATGA CATAAAATGT CCTCCTTAAT AAGTAAGCGT TTACTTTGTG TATTACAAAA 2 0 940 

ATATCTTAAC TCTTTTTCAA AACTTTTAAA ATATTTTGTT TGGAAATTTC AGAAATTTTA 21000 

TGTCTATGAT AAAAATCCTT ATAACGGCAA TAAAAAATAG ATATTATCCA AAGAAGATTT 210 60 
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TAAGTGCTAC AATAACTGTA TTATTTCTAG ATGGGAGGTT CTATTTTTGG ATTGATCCAT 21120 

TGTTGAACAA TATCTACCAC TATATCAAAA GGCATTCTTT CTGACCTTGC ATATTGCAGT 21180 

TTGGGGAATT TTGGGATCCT TTCTGCTCGG TTTAATCGTT AGTATCATCC G AC ATT AT C G 21240 

AATCCTTGTT TTGGCGCAAG T AG CG AC AG C CTACATTGAA TTGTCACGTA ATACGCCCCT 213 00 

TTTGATTCAA CTCTTCTTTC TCTACTTCGG TCTTCCCCGA ATCGGGATTG TCCTATCTTC 213 6 0 

AGAAGTCTGT GCAACGCTTG GGCTTGTCTT TTTAGGAGGC TCCTATATGG CAGAATCTTT 2142 0 

C CG AAGTGGG CTGGAAGCCA TCAGTCAAAC CCAGCAGGAG ATTGGCCTCG CTATTGGTCT 214 80 

GACACCTCTA CAGGTCTTTT ACTATGTGGT TCTTCCGCAA GCAACAGCGG TGGCACTCCC 2154 0 

CTCCTTTAGT GCCAATGTCA TTTTCCTTAT CAAGGAAACC TCTGTTTTCT CAGCAGTGGC 21600 

TTTGGCCGAC CTCATGTACG TCGCCAAGGA TTTGATTGGT CTCTACTATG AGACAGACAT 216 60 

TGCGCTAGCT ATGTTGGTAG TTGCTTATCT AATCATGCTG CTACCCATCT CACTGGTCTT 2172 0 

TAGCTGGATA GAAAGGAGGC TCCGCCATGC AGGATTCGGG AATCCAAGTA CTCTTTCAAG 21780 

GAAATAATCT CCTGAGAATC TTACAGGGAT TGGGCGTTAC GAT TGGG AT A TCCATCCTGT 21840 

CTGTCCTCTT ATCCATGATG TTCAGAACAG TCATGGGAAT CATCATGACC TCCCATTCTA 219 00 

GAATCATACG ATTTTTAACA CGATTGTATC TGGAATTTAT CCGTATCATG CCCCAGCTGG 219 60 

TGCTACTCTT CATCGTTTAC TTTGGCTTGG CTCGAAACTT TAATATCAAT ATCTCAGGTG 22 02 0 

AGACTTCAGC TATTATCGTT TTTACCCTCT GGGGAACAGC TGAAATGGGA GACTTGGTAC 2 2 080 

GTGGAGCTAT CACTTCTCTC CCTAAACATC AGTTTGAAAG TGGACAGGCA CTCGGCTTGA 2214 0 

CTAATGTTCA ACTTT AC T AC CACATCATCA TCCCACAAGT CTTAAGAAGA CTGCTACCGC 22 2 00 

AGGCTATCAA TCTTGTCACT CGGATGATTA AAACCACTTC ATTAGTTGTT TTGATTGGGG 2 22 60 

TTGTGGAAGT GACCAAAGTT GGACAACAAA TCATCGATAG CAATCGCCTG ACCATCCCAA 2 2320 

CTGCTTCATT TTGGATTTAT GGAACCATTC TAATCTTATA TTTCGCAGTT TGCTACCCTA 22 3 80 

TTTCCAAACT ATCCACTCAC T T AG AAAAAC ATTGGAGAAA CTAAATGTCT GAAACTATCT 22 440 

TAG AAAT C AA GGAACTAAAA AAATCCTTCG GAGACAATCC CATCCTCCAA GG ACTTT C TC 22 500 

TAGAAATCAA AAAAGGGGAA GTTGTTGTCA TCCTAGGGCC ATCTGGTTGT GGGAAAAGTA 2 2 560 

CCCTCCTTCG TTGCCTCAAC GGCTTAGAAA GTATTCAAGG TGGAGATATT CTTCTGGATG 22 620 

GTCAGTCTAT CGTTGAAAAT AAAAAAGATT TTCACCTAGT TCGCCAAAAG ATTGGCATGG 22 680 

TCTTTCAAAG TTATGAACTC TTTCCCCATC TGGATGTCTT ACAAAACCTC ATCCTAGGCC 22740 

CTATCAAAGC TCAAGGAAGG GACAAGAAAG AAGTAACGGA AGAAGCTTTG CAATTACTAG 22 800 



WO 98/18931 



PCT/US97/19588 



444 

AGCGTGTCGG TTTGCTGGAT AAACAACATA GCTTTGCCCG TCAATTATCT GGTGGACAGA 22 860 

AGCAACGTGT TGCAATTGTC CGTGCCCTCC TAATGCATCC AG AAATC AT C CTTTTTGACG 22920 

AGGTGACTGC TTCGCTGGAT CCAGAAATGG TGCGTGAGGT GCTGGAACTT ATCAATGATT 22 980 

TGGCCCAAGA AGGCCGTACC ATGATTTTAG TAACCCACGA AATGCAGTTT GCCCAAGCCA 2 3 040 

TTACTGACCG GATTATCTTC CTCGACCAAG GGAAAATCGC T G AAGAAGG A ACAGCTCAAG 23100 

CCTTCTTTAC CAATCCGCAA ACCAAACGAG CCCAGGAATT TTTAAACGTC TTTGACTTTA 23160 

GCCAATTCGG CTCATATCTA TAAAGGAGAT T CTT ATG AAA CTATTCAAAC CACTCTTAAC 2 3220 

TGTTTTAGCA CTTGCCTTTG CCCTTATCTT TATCACTGCT TGTAGCTCAG GTGGAAACGC 232 80 

TGGTTCATCC TCTGGAAAAA CAACTGCCAA AGCTCGCACT AT C G ATG AAA TCAAAAAAAG 23 340 

CGGTGAACTG CGAATCGCCG TGTTTGGAGA TAAAAAACCG TTTGGCTACG TTGACAATGA 23400 

TGGTTCTTAC CAAGGCTACG CTACGATATT GAACTAGGGA ACCAACTAGC TCAAGACCTT 234 60 

GGTGTCAAGG TTAAATACAT TTCAGTCGAT GCTGCCAACC GTGCGGAATA CTTGATTTCA 2 3 520 

AACAAGGTAG ATATTACTCT TGCTAACTTT ACAGTAACTG ACGAACGTAA GAAACAAGTT 2 3 580 

GATTTTGCCC TTCCATATAT GAAAGTTTCT CTGGGTGTCG T AT C AC C T AA GACTGGTCTC 2 3 640 

ATTACAGACG TCAAACAACT TGAAGGTAAA ACCTTAATTG TCACAAAAGG AACGACTGCT 23700 

GAGACTTATT TTGAAAAGAA TCATCCAGAA ATCAAACTCC AAAAATACGA CCAATACAGT 237 60 

GACTCTTACC AAGCTCTTCT TGACGGACGT GGAGATGCCT TTTCAACTGA CAATACGGAA 2 3 820 

GTTCTAGCTT GGGCGCTTGA AAATAAAGGA TTTGAAGTAG GAATTACTTC CCTCGGTGAT 238 8 0 

CCCGATACCA TTGCGGCAGC AGTTCAAAAA GGCAACCAAG AATTGCTAGA CTTCATCAAT 2 3 940 

AAAGATATTG AAAAATTAGG CAAGGAAAAC TTCTTCCACA AGGCCTATGA AAAGACACTT 2 4000 

CACCCAACCT ACGGTGACGC TGCTAAAGCA GATGACCTGG TTGTTGAAGG TGGAAAAGTT 24060 

GATTAGTCAT TAACTCTTAA AAGGAACTGG ATTTTAAGCT CCAATCCCTT TTTAAGATTT 2412 0 

TACCTATAAC ATCCTGAGTC TATCTAAGAT GTTCAATCTG AACACAGTGT ACATACTTTA 24180 

TCTTCTATTG CAT AT AC T TT AT C AC AT AAG ATACGAATAT CCTCTTCACT ATGACTAGCA 2 4240 

ATCAAAATTG TTGTCCCTTT TTCACTAGAG AGCTTTCTAA ACAATGTTCT CATATTTTCT 243 00 

ACACTTGATT TATCCAAGGC ATTCATAGGT TCATCTAGTA AAAGAATAGA GGGATTCTCC 243 60 

ATAATTGCTT GAGCAATCCC TAGCTTTTTC CTCATACCTA GCGAATAAGT TTTAACTTTC 24420 

TGGTCTTTTT GC T CAT AT AG ACCAACTATT TTCAGTGTAT CATTGATTTC CTGATTACCA 2 4480 

ACTACTCCTC GTATGCTTGC CAAATATTGT AAATTCTTAA AGCC AC TATA ATAATTTATA 24 540 

AAACCAGGTT CTTCAATCAA AGCTCCCAAA TTAGCTGGAA TTTTTCTCTC AGGAACAATA 24 600 
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TTTTCCCCAT TGATTAACAC TTCTCCATAA GACGGACTAT ATAAACCAGC TATTAATTTA 



24660 



AACAATACAC TTTTCCCTGA GCCATTCGCA CCAGTAATTC CTATAATTTC CCCCTGTTTA 



24720 



CAACTAAAGT TAAGGTTTTG AAAAACACAT GTCTTTTTTA ATTTCAACTC AATATTTTTT 



24780 



AATGTAATTA TTTC ATT CAT TCTATAAACC TCCTCTTTTG ACGAGTGAAA TAGAAAATGC 



24840 



TTTGAAAAAG AAAGACTAAA AATAGCAACT GAAGAAATAA ATCTCGTCCT ATATCTCCAT 



24900 



TCCCTCGATT CAAAATATAA AATAGATAAT TAGTTCGATT TCCTACAAAT AGACCACCAA 



24960 



ACACAATCAT GAGTAAAAAG AAACTAACGC AAGCAAAGTT CG 



25002 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11443 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CAGGTACGGT GAGGCGCAAC TAAAATATAA TTTTCATCTT GATTAGGAAT TTTATCAGTA 60 

TTATGATAGT GAGCATTGCC ATTGATGGAC C AT AAG AG C A ATACAACTAA TCCACGCAAA 120 

TAAGTATAAA ACATGCGATC TCCTTCGATT GTTTTCTTGT TATTATTATA CCTTATCAAA 180 

GGAGGGCTGG C AAACTTT T C CCTTGACTAG ATACATATTT AGGATGAAAT TAGAATTCTG 24 0 

TTAAAAAAAA TGATATAATA GAATTTATGG ATAAAAATAA GATTATGGGA TTAACCCAAA 3 00 

GAGAAGTCAA GGAAAGACAG GCTGAGGGTT TGGTCAATGA CTTTACCGCA TCAGCCAGTA 3 60 

CCAGCACTTG GCAAATCGTT AAACGAAATG TCTTTACCCT TTTTAACGCT TTGAACTTTG 42 0 

CCATTGCTTT GGCTCTTGCC TTTGTGCAGG CTTGGAGCAA TCTGGTCTTC TTTGCTGTTA 480 

TCTGCTTTAA CGCTTTTTCT GGGATTGTGA CCGAGCTACG AGCCAAACAC ATGGTGGACA 540 

AGCTCAATCT CATGACCAAG G AAAAGGT C A AAACCATCCG TGATGGTCAG GAAGTTGCTC 600 

TTAATCCTGA AGAATTAGTG CTAGGAGATG TCATTCGTTT GTCTGCAGGA GAGCAGATTC 660 

CTAGTGATGC CTTGGTTTTG GAAGGCTTTG CGGAAGTCAA TGAAGCCATG TTAACGGGAG 72 0 

AAAGTGATTT GGTGCAAAAG GAAGTTGACG GCTTACTTTT GTCAGGAAGT TTCCTAGCCA 7 80 

GTGGGTCAGT TTTATCTCAA GTTCACCATG TCGGTGCAGA CAACTATGCT GCCAAACTCA 840 

TGCTTGAGGC TAAGACCGTT AAACCCATCA ACTCCCGTAT CATGAAATCG CTGGACAAGT 900 

TGGCTGGTTT TACTGGGAAG ATTATCATTC CCTTTGGTCT GGCTCTCTTG CTGGAAGCCT 960 
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TGCTTTTAAA AGGCCTGCCT CTCAAGTCAT CCGTTGTAAA CTCGTCGACA GCTCTTTTGG 102 0 

GAATGTTGCC TAAGGGAATT GCCCTTTTGA CCATTACTTC GCTCTTGACT GCAGTGATTA 1080 

AGTTGGGCTT GAAAAAGGTC TTGGTGCAGG AGATGTACTC TGTTGAGACC TTGGCGCGCG 1140 

TGGATATGCT CTGTCTGGAC AAGACGGGTA CCATCACCCA AGGAAAGATG CAGGTGGAGG 12 0 0 

CTGTTCTTCC GTTGACGGAA ACGTATGGTG AAGAGGCTAT TGCCAGCATC TTGACTAGCT 12 60 

AC ATGGC CC A TAGTGAGGAT AAGAATCCAA CTGCCCAAGC CATTCGCCAG CGTTTTGTGG 132 0 

GAGATGTTGC TTATCCTATG ATTTCCAATC TTCCCTTCTC GAGCGACCGC AAGTGGGGGG 1380 

CTATGGAGTT AGAAGGCTTG GGGACAGTTT TCTTAGGGGC ACCTGAGATG TTGCTTGATT 1440 

CTGAAGTCCC AGAAGCTAGG GAGGCCTTGG AGAGAGGATC ACGTGTCTTG GTCTTAGCTC 1500 

TCAGTCAGGA GAAATTAGAC CATCACAAAC C AC AG AAAC C ATCTGATATT CAGGCTCTAG 15 60 

CCTTGCTGGA AATCTTGGAC CCCATTCGAG AGGGAGCAGC AGAGACGCTG GACTATCTCC 162 0 

GTTCTCAGGA GGTGGGACTC AAGATTATCT CTGGTGACAA TCCAGTTACG GTGTCCAGCA 168 0 

TTGCCCAGAA GGCTGGTTTT GCGGACTATC ACAGCTATGT AGATTGCTCA AAAATCACCG 17 40 

ATGAGGAATT GATGGCCATG GCGGAGGAGA CAGCTATTTT CGGACGTGTT TCCCCTCATC 1800 

AAAAGAAACT CATCATCCAA ACGTTGAAAA AAGCGGGACA TACAACGGCT ATGACAGGGG 18 60 

ACGGGGTTAA TGATATCTTG GCCCTTCGTG AGGCGGATTG TTCTATCGTG ATGGCGGAGG 192 0 

GGGATCCAGC AACCCGTCAG ATTGCCAATC TGGTTCTCTT G AACT C AG AC TTTAATGATG 1980 

TTCCTGAGAT TCTCTTCGAG GGTCGTCGCG TGGTCAATAA CATTGCCCAC ATCGCCCCGA 2040 

TTTTCTTGAT AAAGACCATC TATTCCTTCC TGTTAGCAGT CATCTGTATT GCCAGTGCTT 2100 

TACTAGGTCG GTCAGAGTGG ATTTTGATTT TCCCCTTCAT TCCGATCCAG ATTACCATGA 2160 

TTGACCAGTT TGTGGAAGGT TTCCCACCAT TCGTTCTGAC TTTTGAGCGA AAT AT C AAAC 222 0 

CTGTTGAGCA GAATTTCCTC AGAAAATCCA TGCTTCGTGC CCTACCAAGC GCTCTCATGG 2280 

TCGTCTTCAG CGTCCTGTTT GTGAAAATGT TTGGCGCGAG TCAAGGTTGG TCTGAGTTAG 2340 

AAATCTCAAC TCTACTCTAT TATCTCTTGG GGTCAATTGG TTTCTTATCC GTATTTAGAG 24 00 

CCTGCATGCC ATTTACCCTA TGGCGTGTCC TCTTGATTGT TTGGTCAGTA GGAGGTTTCC 2460 

TAGCCACAGC TCTCTTCCCA AGAATTCAAA AACTGCTTGA AATTTCAACC TTAACAGAAC 2520 

AAACGTTGCC TGTTTATGGT GTCATGATGT TGGTCTTTAC CGTGATTTTC ATCCTGACCA 2580 

GTCGTTACCA AGCGAAAAAA TAAATCAAAA CCACCAGTGT GAACTGGTGG TTTGTTCTGC 2 640 

GGCTATAAGC CGCTTCTACC GGCCAGGGCC AAAGGCCCAC CGAAATAGCT TCCTCGCGCA 2 7 00 

CCACTTTCCC GAGCAGGTGC TAAAGCACCT TAGTTACTTC CTCTTATTTA TTTCGCCAGT 27 60 
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AAACGGATCT AC TG AC T C G A ATAACGTGAG CTGGTCTGCT ACTCTGTCTT CTTGTAATTG 2 82 0 

ATTCTGAATA TATTCAGCTA TCACTTTCTG ATTACGGCCT ACCGTATCTA CATAATAGCC 2 880 

T C T AC AC C AA AACTTGCGAT TGCCATATTT GTATTTTAAA TTCGCATGCT TAT C AAAAAT 2 94 0 

CATCAAACTG CTCTTGCCCT TTAAATAGCC CATAAAGGAC GAAACACTAA GTTTCGGAGG 3 000 

AATACTGATA AGCATGTGAA TATGGTCTGA ACAAGCATTC GCTTCATGGA TTATTACACC 3060 

CTTACGCTCA CATAAGTCAC GTATGATTCT T C CG AT AC T A GCTTTGTATC TGCCATAAAT 312 0 

GATTTGACGA CGATATTTGG GTGCAAAAAC AAT AT GAT AT TTACAATTCC ATGTGGTATG 3180 

TG AT AAACT T TGATTATCCT CTCTCATGAG GTACCTCCTG TATGATATGT TGTAGTGGCG 3 24 0 

GAGAAACCAC TTCTATCTTA TCATTTTAGG AGGTTCTTTT TGTTACCACG CTAAAAGCTC 33 00 

TATGGAAcCA CTAGCATAGC TAGTGGTTTT CGGGAGACAA CAAGAAAGAC TGCAATCTGT 33 60 

GGATTGCAGT TTTTTATACG ATGGATCTAT CGTAGATCTG ATGTGCAAGG CCTACGTGCC 342 0 

GATCATCTAT CGGTGAACCC AAGAGCGACC CTCAAGCCTG CTTGGATTGA GGTAATAGAT 3 4 80 

TCAAATATCT GTAGTTAGAC TATTTGAAGT TTGATGTAAG AAAGAGAAAG CGACAGATTG 3 54 0 

AAGTAATTTT AACTCTCTTC TATTGCTAGA ACAAATGGTC GGATAGGTTG GTAGTTTGAA 3 600 

AATGAAGATG C T AT CT ATT G TTAAATGGAA CATAGTGTTA TTTATTAGAA AATCGTTTGG 3 66 0 

TTTATTTCTT ATCAAATACG AAAAGCAACT TAAATATTTC AACTAAAATA GATGTTATGA 3 72 0 

AGAAAAGGTA AAATGATTTT GGCATAGTGA GGTTCTGTTC TATTTGATAT CATATTTTTG 3780 

ATAAAAACAA AAATGTCCAT TGCAAAGGAC AAAATGCGAA GT AT ATT AT T TTTTGAAAGC 3 840 

GATATAATGG ATT C ATAAAG GAGGTGTATC GTGTCTAGAA AACAAGAACA AATGGAAACG 3900 

TTGTTGCTCC TTTTGCGAGA TAGTAAGGAT TAT AT AT C T G CTAAAGTATT GGGAGAAAAA 39 60 

TTAAATTGCT CTGATAAAAC GGTTTATCGC CTTGTCAAGG GAATCAACAA AG AT T GTCCG 4020 

GTAGAAGCAT TCATTTTATC TGAAAAAGGC AGAGGTTTCA AATTAAATCC AAGAAGTTCC 4080 

CTCGTGGACG TTGATGGGAA TTTTACAGAG GCTTTTGATC CTGAAGTAAG GCGTGAAAAA 4140 

TTACTAGAAC GTCTCTTGTT GACTGCTCCT AAGC C AC AT T CTATTTATGA TTTAGGAGAG 42 00 

GAATTCTACG TAAGCGAGTC AGTAGTACTA AAAGATCGTC AGATATTACA AGAGAGTCTA 42 60 

GCAATTTATG GGT TAG ATT T AAAAATGAGA CAACGAAAGC TTTTTATTGA TGGGGATGAG 432 0 

GCTCAAATTC GTTCAGCCAT TCTAAATCTA CTGCCAATGT TTAATCAGTT GGATTTAGAG 4380 

CAAATTACAC AGAATAAGGT TCAGCCTCTT GACGGAGAAC TTGCTCACTT TTGTTTGGGA 4440 

TTACTGATTA CACTTGAGAG AGAATTGGGG GTAAACATTC CCTATCCATA TAATATAAAT 4500 
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ATTTTCTCTC ACCTGTATAT TTTTATCAGT AGGAATCGTC G T AGT ACT AG TATTCATGTT 4 5 60 

GTAGCACCTT CAAAACCTAC TATTGTTGAT GAGAAAATTT ACAGTGTCTG TCAAAAAATT 4 62 0 

ATTCAAGAAA TTGAACAATA TTTTAGGATG AAGGTTGATG CAGTTGAGAT TGACTATCTT 4680 

TATCAATACG TTGTATCTTC GAGATTGCAA AAACCATTTT CTTCCGGGAA GCTTCCTTTT 4740 

TCTCAGCGAG TTTTAGATGT C ACT CAT T AC TATTTTAGCC GTATGTGTAT GGACAATAGA 4800 

GAGATTGAAA CGACAGATCC TGACTTTGTT GACTTGGCGA GTCATATCAG TCCCTTACTG 4860 

AGGAGATTAG ATAATAGAGT ACAGATTAAG AATAGTCTTT TATCACAAAT TCTTTTAACC 492 0 

TATCCTAATC TGGTTAAAGA GTTAACAACT ATTTCTAAAG AAGTGAGTCT AGTATTTGGT 4980 

TTTGCTTCCT TGAGTCTGGA CGAGATTGGT TTTCTAGTCT TAT AT T T T GC ACGGTTTCAA 5040 

GAAAAGCGAG CACGTCCTCT AAAAACAGTA GTGATGTGTA CATCAGGTGT CGGAACTTCA 5100 

GAGCTTTTAC GAGCACGATT AGAAAAGCAA TTTTCTGAAT TGGATATTAT TGATGTAGTT 5160 

GCTTATCATC AATTAGATGA GCTGATAAAT CTATATCCAG ATTTAGATTT CATTGTGACG 5220 

ACGGTAGCTT TGCAGGAACC AGC AAGTGT C CCGTTTGTCC TAGTTAGTGT TTTTCTAACC 52 80 

GAGGGTGATA AACAACGTCT TCAAGCAAAA ATTCAGGAGA TAAACTATGA ATAATCTTTC 5340 

GCTTGTCCTT ATGGATATAT CTGTTCAAAA TCGTCAAGAA GCCTACAAAG AATTAGCAAA 54-00 

TCAAATCAGC CTTCTTGTTT CTGAAGATAC AGAAAAAATA GAAGAGCTTC TAT AT T AC CG 54 60 

T GAG AG AC AG GGAAGTATAG AGGTTGCTAA AGGTGTTCTT C T AC C AC AT T GTGAAGGAAA 5520 

CTTTCAACAT CATGTCTTAG T GAT TACT AG ATTAAAATCA CCT AT C AG AG AATGGTCGAA 5580 

GGATATCCAG TGTGTTGACC TTATTATCGG TTTGGCCATT GC AGT AT C AC AGGACAAGTC 564 0 

ATGTATTAAA ACATTGATGA G AAG ACT AG C AG ATGAAT C A TTCATAAATC AATTAAAACA 57 00 

GTTAACAAAA GAAGAATTAC GGGAGATAAT ATATGGAAAT CAAAGATATT CTTAATGTGA 57 60 

GTCTGATCCA GACGGATTTA CAGATGCAGA GCAAAGAAGA GGTTTTTGAG GCATTAGCTC 582 0 

AACTATTGGT TGAGACGGGT TATGTGTCTG ATAGAGACCA ATTTATCGAA GGTCTTTATC 5880 

AGAGAGAGGC AGAAGGACAG ACCGGTATTG GGAATTATAT TGCTATTCCC CATAGCAAGA 5940 

GTTCTGCTGT GGAGAAGGCG GGGGTAGTCA TAGCTATAAA TCACAATGAG ATTCCTTGGG 6000 

AGACCATTGA TGGGAAAGGG GTCAAAGTAA TTGTACTCTT TGCAGTTGGT GATGATACAG 6060 

AAGCTGCTAG GGAGCATTTG AAGACCTTAT CACTCTTTGC TCGAAAACTT GGTAATGACG 612 0 

AAGTTGTTGC CAAATTAGTT CGGGCTCAGA CATCTGATGA TGTGATTGCA GCTTTTTGTT 6180 

AATAAGAAAA AATTTTGGAG GGTATCCGTA TGAAAATTGT TGGTGTTGCA GCTTGTACTG 6240 

TGGGAATTGC CCACACTTAT ATTGCACAGG AAAAATTAGA GAATGCCGCA AAGGTAGCTG 63 00 
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GACATGTGAT TCATGTTGAG AC T C AGGGG A CAATAGGGGT AGAAAATGAA TTGAGTCAAG 63 60 

AGCAGATTGA TGCAGCGGAT GTAGTTATTT TAGCAGTTGA TGTTAAGATT TCTGGTATGG 6420 

AACGCTTTGA GGGTAAAAAG ATTATCAAGG TTCCAACAGA AGTGGCAGTC AAATCTCCCA 64 80 

ATAAACTGAT TGCTAAAGCT GTTGAGATTG TTACGAAATA ACTGAAAATA TTTAAGGAGA 654 0 

AAATAT AT GT TGAAACACTT AAACTTAAAA GGTCACTTAT TG AC AGC CAT TTCCTATATG 6 600 

ATTCCAATTG TTTGTGGTGC AGGATTCTTA GTTGCCATTG GTTTAGCAAT GGGGGGTGGT 6660 

GTTCCTGACG CTCTTGTAGC AGGAAAATTC ACTATCTGGG ATGCTTTAGC AACTATGGGT 6720 

GGTAAAGCCC TTGGTCTCTT GCCAGTTGTT ATTGCTACAG GTTTGTCTTA CTCGATTGCT 6780 

GGTAAGCCAG GGATTGCACC AGGTTTTGTT GTTGGTCTAA TTGCCAATTC TGTTGGTTCA 6840 

GGGTTTATCG GTGGTATCTT GGG AG GTT AT ATAGCTGGTT TCTTGGTTCA AGCGATTATT 6900 

AAAAAGGTCA AAGTACCAAA CTGGATTAAA GGTTTAATGC CAACCTTGAT TATTCCTTTT 6960 

GTAGCCTCTT TGGTAAGTAG TTTGATTATG ATTTATATTA TTGGAGCGCC TATCGCAGCC 7 02 0 

TTTACCAACT GGTTGACGAG CTTATTACAA AGCTTGGGAA GTGCTTCAAA TGGTTTGATG 7080 

GGGGCAGTTA TTGGAATTCT CAGTGCTGTT GACTTTGGTG GCCCACTTAA TAAAACAGTC 714 0 

TATGCGTTTG TGTTGACTTT ACAGGCTGAA GGTGTGAAAG AACCATTGAC TGCTTTACAA 72 00 

TTGGTGAATA CTGCTACACC AGTTGGATTT GGATTGGCCT ATTTTATCGC GAAATTACTC 72 60 

AAAAAAAATA TC T AT AC TC A AGAGGAAATC GAAACATTGA AATCGGCTGT TCCTATGGGG 732 0 

ATT GTCAAT A TTGTTGAAGG TGTAATTCCG ATTGTTATGA ATAACTTGGT TCCAGGTCTC 7380 

AT TG C AAC AG GTATCGGTGG TGCTGTTGGT GGTGCTGTTT CTTTGACAAT GGGTGCTGAT 744 0 

TCTGCTGTGC CATTTGGTGG AGTGCTT AT G TTACCAACCA TGACTCGTCC AGTAGCTGGT 7 500 

ATTTGTGCCT TGTTAGCTAA CATTGTAGTC ACAGGACTTG TCTACGCGAT TTTGAAAAAA 7 5 60 

CCAATAAAAC ATGCAGAACC AGTTATGACT GTTGAAGAAG AGATTGATTT GTCAGATATT 7 62 0 

GAAATTTTGT AAGAGGGTAA CGATGTCAAG AATTGAATTT TCACCATCTT TGATGACCAT 7680 

GGATTTGGAC AAATTCAAAG AGCAGATTAC TTTTTTGAAT GATAAAGTAG C ATCTT AT C A 7740 

TATCGATATT ATGGATGGCC ATTTTGTTCC CAATATTACC TTGTCTCCTT GGTTCATTCA 7800 

AGAAGTTCAA AAAATTAGTG ACACACCTTT AT C AGTT C AT CTGATGGTCA CAGACCCAAC 7860 

CTTTTGGGTA GATCAAGTTC TCGATTTACA ATGTGAGTAT ATTTGTATTC ATGCTGAAGT 792 0 

TCTGAATGGT CTTGCTTTTC GTTTGATTGA TAAAATTCAT GATGCAGGTC TAAAGGCTGG 7980 

TGTTGTCCTT AATCCTGAAA CACCTGTTTC TACAATCTTT CCCTACATTG ATTTACTTGA 8040 
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CAAAGCAACT ATTATGACTG TAGATCCAGG TTTTGCAGGA CAACGCTTTT TGGAGTCTAC 8100 

CTTGTATAAA ATCCAAGAAC TCCGTCAGCT TAGAGTTCAG AATGGTTATC ACTACATCAT 8160 

TGAGATGGAT GGTTCTTCGA GTCGTAAGAC TTTCAAACAA ATTGATGTGG CAGGACCAGA 8220 

TATTTATGTT ATAGGTCGCA GTGGATTATT TGGTTTGGAT G AC G AT AT TG CCAAAGCCTG 8280 

GGATATCTGT TCTAGAGATT ACGAAGAAAT GACCGGAAAA ACAATGCCAA TCAAATAATG 8340 

GTTTGAGAAG AAATTTATTA GTTAGGAGGA ATATATGTCA CTACAATCAG T T AACGCC AT 8400 

TCGTTTTCTT GGCGTAGATG CTATTAACAA ATCTAATTCT GGTCACCCGG GAATTGTCAT 8460 

GGGTGCTGCG CCAATGGCTT ATAGCCTATT TACAAAGCAC CTTAGAATTA CACCTGAGCA 8520 

GCCAAACTGG ATTAACCGAG ATCGCTTTAT CTTGTCTGCG GGTCATGGAT CAATGCTACT 8580 

GTATGCTCTC TTGCATTTAA CAGGGTATAA GGATGTATCC ATGGACGAGA TTAAAAATTT 8640 

CCGGCAATGG GG AT CTAAG A CACCTGGTCA TCCTGAAGTG ACGCATACGT CTGGTGTGGA 8700 

TGCGACATCT GGTCCGCTTG GTCAGGGGAT TTCTACTGCC GTTGGTTTCG CCCAAGCAGA 87 60 

GCGTTTTTTA GCTGCTAAGT ACAACAAAGA TGGTTTCCCT ATTTTTGACC ATTATACTTA 882 0 

TGTTATCGCT GGAGACGGTG ACTTCATGGA AGGAGTGTCT GCGGAGGCGG CTTCTTATGC 8880 

AGGTCATCAA GCTT TAGAT A AGCTTATCGT CCTCTACGAC TCCAACGACA TCTGCTTGGA 8940 

TGGTGAGACC AAAGATACTT TCTCTGAAAA TGTTCGCGTC CGTTACGATG CTTATGGTTG 9000 

GCATACAGTT CTGGTAGAAG ATGGAACAGA TTTAGCAGCA ATTTCTACAG CAATTGAGAC 9060 

GGCCAAGTTT TCTGGTAAAC CGAGTTTGAT TGAAGTGAAA ACGGTAATTG GTTACGGCTC 912 0 

AC C C AAT AAA AGTGGTACAA ATGCTGTTCA TGGTGCACCA CTAGGAGCAG AAGAAACAGG 9180 

AGCAACTCGT AAGTTTTTGG GATGGGATTA CG AT C CAT T T GAAGTACCAG AGGAAGTATA 924 0 

TTCTGATTTC AAGACAAATG TAGCGGATCG TGGTCAGGAG GCATACGATG CTTGGGCTAG 9300 

TTTGGTGTCT GATTACAAGG TTGCTTATCC CGAAGTTGCT AGTGAGATTG ACGCTATTGT 93 60 

AGCTGGAAAA TCCCCTGTAA CCATTACTGA AAAAGACTTC CCTGTCTATG AGAATGGCTT 9420 

CTCTCAAGCA ACTCGTAATT CGTCCCAAGA TGCTATTAAT ACAGCAGCAG TTTTACCAAC 9480 

CTTCTTAGGT GGATCGGCAG ACTTAGCTCA CTCTAACATG ACCTACATCA AGGCAGATGG 9540 

CTTACAAGAT AAATATAATC CATTAAACCG C AAT ATT C AG TTTGGGGTAC GTGAATTTGC 9600 

CATGGGAACA ATCCTCAATG GAATGGCTCT TCATGGTGGT TTACGAGTTT ATGGCGGAAC 9660 

CTTCTTTGTT TTCTCTGACT ACGTCAAAGC TGCTATTCGG C TAT C AG CCA TTCAGGAGTT 972 0 

GCCTGTAACT TATGTCTTTA CCCATGATTC AATTGCCGTT GGTGAAGATG GTCCAACTCA 9780 

TGAACCAGTT GAACATTTGG CAGGTTTACG CTCAATGCCA AACTTGACTG TTATCCGTCC 9840 
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AGCGGATGCC CGTGAAACTC AAGCGGCTTG GCATCATGCC TTGACCAGTA CCACCACTCC 9 9 00 

AACTGTCATT GTCTTAACCC GTCAAAACTT GGTAGTTGAA GAAGGGACAG ACTTTGGTAA 9960 

GGTCGCTAAA GGAGCCTACG TCGTGTATGA TACCCCGGGA TT TG AT ACT A TTATCATTGC 10020 

TACAGGATCT GAGGTCAATC TAG C T AT C AA AGCTGCTAAG GAATTGGTTT TACAAGGTGG 10080 

TAAAGTACGT GTGGTATCTA TGCCCTCAAC CGAACTATTT GATGCTCAAG ATGCTACCTA 1014 0 

CAAGGAAGAC ATTTTACCAT CTAAGACTCG TCGTCGTGTG GCCATTGAAA TGGCAGCGAC 10200 

CCAAAGTTGG TACAAGTATG TTGGTTTGGA TGGCGCGGTC ATCGGTATTG ACATCTTCGG 10260 

TGCGTCTGCC CCAGCTCAGA CTGTGATTGA TAATTATGGA TTTACGGTAG AGAATATCGT 1032 0 

TGCTCAAGTT AAGTCCCTAT AGAAACCAAT TACAATGAAG ATACAGCTGT TGTCAGACTA 103 80 

GCAGATGTAG TGATAGACAC TAATCAGATG ATTGGTTATT TAAAAACTGT AATGAAAATG 10440 

TAATAATTTA TCTACGAAAG T T AT AGT AG A TAGTATACAC AATAGAGTAT ACCCTGAAAC 1050 0 

GGTTGCGAAG TACGCTAATC ACTTTGCTAC TGATCTAGAT AGTTTCTTTA ATCAATAAAC 10560 

ACAGCATCCA CAGATTGACT TAGGATATTG TAAGTTTTTT GAAAGCTAGA GAGAAGGTCT 10620 

CTAAAATTAA AAAACGCATA GTATAGGATG TTGAAATGAT GAACTGCACC CCAAAAGTTA 10 680 

GACAGAAAAA AATCTAACTT TTGGGGTGTT TTTATTATGA AATTAACTTA TGATGATAAA 10740 

GTTCAGTTCT ATGAACTTAG AAAACAAGGA TATATCTTAG AGAAGCTTTC AAATAAATTT 10800 

GGGATAAATA ATTCTAATCT TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG 108 60 

TTCGTCAAAA AAGGGAAAAA TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT 1092 0 

AAAGTCTGAC ATGAAGGCTG GACTAAAGAT AGAGTTTCTC TTGAATACGG TCTCCCAAGT 10980 

CGTACGATAC TTCTTAACTG GCTAGCACAA TACAGGAAAA ACGGGTATAC TATTGTTGAG 11040 

AAAACAAAAG GG AGAGT AC C TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT 11100 

CCGATTGAAG GAGGAAAAAG AGAAATAAGA AAGACAGAAA TTGTTCAAGA ATTAATGACT 11160 

GAGTTTTCGT TAGATCTTCT TCTAAAAGCC ATTAAACTAG CTCGTTGGAC C T AC T AC TAT 112 2 0 

CACTTGAAAC AGCTAGATAA AC C AG AT AAG GACCAAGAGC TTAAAGCTGA AATTCAATCC 112 80 

ATCTTTATCG AACACAAGGG AGATTATGCT TATCGCCGGG TTCATTTAGA ACTAAGAAAT 113 4 0 

CGTGCTTATC TGGTAAATCA TAAAAGAGTT CAAGGCTTGA TGAAAGTACT CAATTTACAA 11400 

GCTAGAATGC GACAGnAACG AAAATATTCT TCTCATAAAG GAG 11443 

(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5338 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CCAATTACAT TAT ATT AT C A AAATCGTCGA AACTGGCTCC ATGAATGAGG CAGCCAAGCA 60 

ACTCTTTATC ACTCAGCCAA GTCTCTCCAA TGCAGTGCGA GATTTGGAAA ATGAAATGGG 12 0 

CATTGAGATC TTTATCCGCA ATCCCAAGGG AATCACCTTG ACCCGTGATG GCATGGAGTT 18 0 

TCTCTCTTAT GCCCGTCAGG TTGTCGAGCA GACCCAGCTT CTGGAGGAAC GCTATAAAAA 2 40 

TCCTGTCGCC CACCGCGAAC TCTTTAGCGT TTCGTCTCAA CACTATGCCT TTGTGGTCAA 300 

TGCCTTTGTC TCTTTGCTCA AGAAAAGCGA TAT GG AG AAA TACGAACTCT TCCTTCGTGA 3 60 

AACTCGGACT TGGGAGATTA T C G AC G AC GT CAAGAACTTC CGCAGTGAGG TCGGGGTCCT 42 0 

CTTCTTAAAC AGTTACAACC GTGATGTTTT AACCAAGATG CTGGATGACA ATCACCTGCT 4 80 

AGCCCACCAT CTCTTCACAG CGCAACCGCA TATCTTTGTC AGCAAGACCA ACCCTCTGGC 54 0 

AAAGAAAGAC AAGGTGAAAC TGTCTGATTT GGAGAATTTC CCTTACCTCA GCTATGACCA 6 00 

AGGG AC GC AC AACTCCTTCT ACTTTTCAGA AGAGATTCTT TCTCAAGAAC ACCACAAGAA 660 

ATCCATTGTG GTCAGTGACC GTGCCACCCT CTTTAATCTC TTGATTGGTT TGGATGGTTA 72 0 

TACCATTGCG AC AGGG AT TT TGAACAGCAA CCTAAACGGA GACAATATCG TTTCTATCCC 7 80 

ACTGGATATT GATGACCCGA TCGAGCTGGT CTATATCCAG CAT G AG AAAA CCAGCCTATC 84 0 

TAAGATGGGC GAACGCTTTA TAGACTATCT CCTAGAAGAA GTTCAGTTTG AT AG T T GAGA 900 

AATGATAAGA ACCAATATGT AGGCTAGCAA CAACCTGCAC ATTGGTTCTT TTT ACT TATA 9 60 

ATTAAAAGTT TCCCCTGCCA ACTTATCAGC TAGCTTGGGA AAGAGAGTAT AAAACTTATG 102 0 

GGCTAGGTTC AACAAAATCG GG AG AT TGAG TTCTCGTTTG TTTTTTCCTA TAATCTTGAC 1080 

AATCTTTTTA GCCACTGCAT CTGGTTCTAG CAGGAAGCGA TCAACCGATT TAAGATAAGT 114 0 

TCCATCTGGG TCGGCTTGGT CGAAAAATCC TGTACGGATT GGTCCTGGAT TGACTGTTGT 12 00 

CACATAGACT CCATAGGGCA TAAGTTCGAG TCGCAGAGCA TTTGAAAAAC CAATAGCCGC 12 60 

AAACTTGGTC GCTGAGTAAA G ACT AG ACT T GCCAGTAGCT ATTAGACCTG CCATGCTGAC 13 2 0 

GATGTTGATG ATATGCCCTT TGCTGCTTTC CTTCATACGA GCCGCAAGGT GACGAGACAG 13 80 

ATTCATCAGG GCAAAGGTAT TG AC CT C AAA CATCTGGTGA ATATCTTTAT CAGCAATCTG 144 0 

GTCAAATCCC TCAAAAATCC CGTAACCAGC GTTGTTAATC AAGACATCAA TCTTGCCATA 1500 

GCGGAGATAA AGATCAGTTA CCAGAGCTTC TAGGGCTGAA TCGTCGGTAA TATCAATTTC 1560 
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AATCAATTCT GCATGGGAAT AATTTCCGTA GAGTTGGGCT AATTTTTCCT TATTTCTACC 162 0 

AAGCAAGATG AGTTGGTCAT TGGGCAGGAG TTTGACCATT TCTTGAGCTA GACCACCGCT 1680 

AGCTCCGGTA AT G AG AAT AG TAGGCATACT TATCCTTTCT GTGACTGCTA GATTTCCACT 174 0 

TCTTCCAAGT CTTTGACCAC ATGGACATTT TCAAAAATTG TGGCAGCGTC TTTCTTGAGT 18 00 

T TG C T AAT AT CTTTTGAGAG GAAACGGGCA CTGATATGGT TGAGTAGGAG GCGTTTGGCA 18 60 

CCTGCTTCTA CCGCTACTTG TGCAGCTTGC ATATTAGTTG AGTGACCATG GTTACGAGCA 192 0 

ATTTTTTCAT CACCCTTGCC ATAAGTGGAC TCATGAACTA GGACATCTGC ATTGACAGCC 19 80 

AGACGCACAC TGGCACCCGT TTTTCGAGTG TCTCCTAAAA TAGTGATAAT CTTACCTGGA 2040 

CGTGGCGCTG AG AT AT AGT C TGCTGCCTTG ATTTCAGTTC CGTCTTCCAA AACAAGATCC 2100 

TGGCCGTTTT TG AT T T T AC C AAAAAGCGGG CCGAACGGAA CACCAGCAGC CTTGAGTTTT 2160 

TCAGCATCCA GCGTCCCTTC TAG AT C C T TT TGCATGACAC GATAGCCAAC ACAGAAAATA 222 0 

GTGTGGTCCA GCTCCTCTGC ATACACAGTG AATTTATCGG TTTCAAGAAT TTTACCCAGA 22 80 

GAATCTTGGT CAAACTCATG GAAAT G AATG CGGTAGGGCA GACGAGAACC T G AC AC ACG A 23 4 0 

AGGCTGGTTA AGACAAATGA CTTGATTCCT TGAGGTCCGT AGATTTCCAA ATCTGTCTGC 24 00 

TCTTCATTGG CCTGAAAGGC ACGGCTAGAA AGGAAACCTG GCAAACCAAA AATGTGGTCT 24 60 

CCATGCAGAT GGGTAATAAA GATTTTGCTG ACCTTACGTG GTCGAATTGT GGTTTCCAGA 2 52 0 

ATGCGATTTT GCGTACCTTC TCCACAGTCA AAG AG C C AAA CTTCGTTAAT CTCATCCAAA 2580 

AGTTTCAGGG C GAG AC T TG A AACGTTGCGG GCTTTAGAGG G C TG AC C AG C CCCCGTTCCT 2 640 

AAAAATTGAA TATCCATTCG ATACTTTCTA ATTAATCAAT ATATAACATG GCTGTGCGGT 2 700 

TTTCCGATCG GAAAT AGCGT T T GC C AG AAA AAGCAGCAGC TTCTTGCAAT AAATCCTCTT 27 60 

GGCTGTAGCC TTTGAGACGT TTTCGACCAT CAGCCAATCT TTCCAAATCA GTCAAAGCTG 2 82 0 

T G AG ACT TT C TAGGCTGATA ACTTCCTCGT CCTCGACAGG CTTCATGTAA ATCTTACCAG 2 88 0 

ACTCTTCAAA GACTAATTGA TGGGGGAAAA TTTGCGCAAT TTCAAAGAGC AAGTCATCCG 2 94 0 

AGATTTTCTC CTCATTTTCA AAGAAAATCC GACCAAGGCC GTCACTCTCA TAACAAAAAC 3000 

CAAAGGATTT ACCAGACAGA TT AAG C CG AA TAAAAGGCTT ATTTTCTAGG GTGAAACTTG 3 0 60 

GCTCAGTATT GTAAAGATTC AGTTCCTGAC TGAGTTCTGC AAAATAATCC GTCGCAGCCT 3120 

GAGGACTCTT TTTCTGATAG AGTTCTGCAA AGTAGGCATT AACAACACTT GGCGGAGGTG 3180 

TAATAAGTGT TAACTGCTCC TGATCTGTTT TACCAGCTAG AAGCTGATCC AGATAGACCT 32 4 0 

TGTCCAGACT TGTATAACCT C CAT ACT T T A GAGCCAAAGT TTT AAT AT C A GTCATAAAAT 3300 
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TCTTCTAACC 


TCCATTTATT 


TTTCTCGGAA 


ATGTAGCCTG 


TAATCACTTC 


GCCGTCTTCC 


3360 


TGATAATCAC 


GTTCTTCCAG 


AATTGCAACA 


CTCTCTAAAT 


CATGAATCTT 


GTAGGACTTT 


3420 


GAAAAAGGCA 


CTCGCAGGGT 


AAATGCTTCA 


AAAATTTCCT 


TAATCTTATC 


TAG C AAT AAT 


3480 


GCTTGCAAGT 


TTTCACGACT 


GTCCTCAGAC 


TTGGCAGAAA 


TGAGGGTATA 


TGGCGTTTGG 


3540 


GTAGGCGTGA 


AATCCTCCAC 


CAAATCCGCT 


T T ATT AT AAA 


GCGTCAAGTG 


AGGAATATCT 


3600 


TCCATGTCCA 


GGTCTTTCAT 


GATGGAGAGA 


ACCGTTTTTT 


CATGCTCCTC 


GTGGTAAGGA 


3660 


TTGCTAGCAT 


CGATAACATG 


AACCAGAAGG 


TCCACATGCT 


TGCTTTCTTC 


CAAGGTTGAC 


3720 


TTGAAACTGG 


ACACCAACTC 


TGTCGGCAAA 


TCTTGGATAA 


AGCCAACGGT 


ATCTGTCAAA 


3780 


GTTACTTGGA 


GATTGCCTCC 


CAGATGAATA 


CTCTTGGTTG 


TCGCATCCAG 


AGTCGCAAAG 


3840 


AGCTCATCTG 


CTTCATACTG 


GGTCTTACTG 


GTCAAGATGT 


TCATGATAGT 


TGATTTCCCA 


3900 


GCATTAGTAT 


AACCAATCAA 


ACCAATCTTA 


AAAGTG CT AG 


ACTCCAAACG 


TTTTTCTCTG 


3960 


ACAGTCGCAC 


GATTTTTCTC 


AACCACCTTG 


AGCTGGCGCT 


CG AT AT C C G T 


GATTTGATTG 


4020 


CGAACGCTAC 


GACGGTTCAG 


CTCCAGCTGG 


CTTTCACCAG 


GACCACGGGA 


ACCAATTCCC 


4080 


CCTgCCTGAC 


GGCTGAGCAT 


AATCCCCTGA 


CCAACCAAGC 


GAGGCAAAAG 


GTATTTGAGT 


4140 


TGGGCTAGGT 


GG ACT TGGAG 


CTTCCCTTCA 


TGGCTTCGAG 


CCCGCATGGC 


AAAG AT AT C C 


4200 


AAAATCAACT 


GCATACGGTC 


AATGACCTTA 


AC AC CG AG AA 


CTTCCTCTAG 


ATTGACATTC 


4260 


TGCCTTGGGG 


TCAGACGATT 


GTTGACGATG 


ACAGTAGTGA 


TTTCTTCTGC 


AT C C AC CAT A 


4320 


AGCGCAATCT 


CTTCCAACTT 


ACC AG AG C CG 


ACGAAGGTCT 


TGG AAT CAT A 


TTTTTCACGT 


4380 


TTTTGTCTGT 


AGCTATCTAC 


AACGACTGCC 


CCTGCCGTTT 


TCGCTAAACT 


AGCCAATTCT 


4440 


TCCATGGAGA 


GGTCAAAACT 


GTCCATACCC 


TGCAATTCCA 


CACCAATCAG 


CAGGACTCGC 


4500 


TCCTCTTTTT 


TCTCCGTTTC 


AATCATCTAA 


AAACTCCTCT 


ATCTGGCTTA 


AAATGCGGTC 


4560 


TTGT AC AC C A 


GATTCTCCAA 


TCTGATAAAA 


GGTGACCTGC 


AT G CG ATT AC 


GGAACCAGGT 


4620 


CAGCTGACGC 


TTGGCAAAAC 


GACGAGTCGC 


CTGTTTAAGA 


CTCTCACTAG 


CTTCCTCCAA 


4680 


GGTCTGCTCT 


CCACGGAAAT 


AAGGAAAGAG 


TTCCTTATAG 


CCAATTCCTT 


TAGCAGCCTG 


4740 


TACATTAGGG 


GAATGGTCAA 


ACAGCCACTT 


GGCCTCATCC 


AAAAGC C C AG 


CCTCAAACAT 


4800 


CAAATCCACT 


CGGTGGTTGA 


TACGCTCATA 


AAGTTGACTA 


CGTTCATCAT 


CCAAGCAGAT 


4860 


AATCAGCGGT 


TCATACAAGG 


TCTCTTGATT 


TTCCAAATCC 


TGACCAAAAT 


GGGCAATTTC 


4920 


TAAGGCACGC 


ATAGCACGAC 


G AC G ATT AAA 


CTGGGGAATC 


TCAAGGCCTG 


CTTGATCCAC 


4980 


CAAATGGGCT 


AATTCCTCAT 


CTGAAT AT GG 


CTCCAAACTA 


GCTCGATAAG 


CTAAAATCTC 


5040 


CTCATGAGGA 


GTCTCCCCAC 


CTAGGTGGTA 


ACCTTCTAGC 


AAGCTCTGGA 


TATAAAGTCC 


5100 
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AGTCCCACCG GCGATAATGG CTAGCTTGCC ACGGTTGTGA ATACCCTCAA TAGTCATCTT 5160 

AGCTTCTGAA ACAAAATCAA AAGCCGAGTA AGACTCGGTT ATCTCTCTAA CATCGATTAA 5220 

AT GAT G AGG A ACAGCTGCCT GCTCTTCTGG ACTAGCCTTG GCCGTCCCAA TATCAAGTCC 5280 

T CG AT AG ACT TGCTGGCTAT CTCCACTAAC CACTTCGCCA TTAAAACGCT TTGCGGGG 53 3 8 



(2) INFORMATION FOR SEQ ID NO : 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGAAACCCA TCTAGTCTCC ATCGTTTGGG AGACCAAGCA ACACGAATCT TAGATGCTTC 60 

TCGCCAACAG ATT GC AG ATT TAATCGGTAA GAAAAGCGAT GAAATCTTCT TTACCTCGGG 12 0 

TGGAACAGAA GGGGATAACT GGCTTATCAA GGGTGTGGCC TTTGAAAAAG CTCAGTTTGG 180 

C AAGC AC AT C ATTGTTTCAG CCATTGAACA TCCAGCAGTC AAAGAGTCAG CCCTCTGGTT 240 

GAAAAGTCAA GGATTTGAAG TGGATTTTGC TCCAGTTGAT AAGAAAGGCT TGGTCGATGT 3 00 

TGAGGCGTTA CAGGTTTGAT ACGGCATGAT ACAATCCTCG TTTCCATCAT GGCTGTGAAC 3 60 

AATGAAATCG GCTCTATCCA ACCTATTGAG GCTATTTCAG AATTCTTGGC AGACAAGCCG 42 0 

ACTATTTCCT TCCACGTTGA TGCGGTTCAG GCGCTTGCCA AAATTCCGAC TGAAAAGTAT 4 80 

CTGACAGAAC GGGTGGATTG CGCGACTTTC TCTAGTCACA AGTTCCACGG GGTTCGAGGT 54 0 

GTTGGCTTTG TCTATATCAA ATCTGGCAAG AAGATTACAC CTCTTCTTAC AGGTGGTGGC 600 

CAGGAGCGAG ATTATCGTTC GACAACTGAA AATGTGGCAG GGATTGCAGC GACAGCCAAG 660 

GCCCTCCGTT TGTCTATGGA AAAGCTAGAT ATCTTTAGGA GCAAGACTGG GCAGATGAAG 72 0 

GCAGTGATTC GCCAAGCTCT TCTGAACTAT CCGGATATTT TTGTCTTTTC AGATGAGGAA 7 80 

AACTTTGCAC CTCATATTCT G AC TTTTGG A ATCAAAGGTG TTCGAGGTGA AGTCATCGTT 84 0 

CACGCCTTTG AAGACTATGA TATTTTCATC TCAACAACCT CAGCTTGTTC AT C T AAGGC A 900 

GG AAAAC C AG CCGGTACCTT GATTGCCATG GGAGTGGACA AAGAT AAGGC CAAGTCAGCT 9 60 

GTGCGTCTTA GCCTAGACTT GG AAAAT GAT ATGAGTCAGG TCGAGCAGTT TTTGACCAAG 1020 

TTAAAATTGA TTTACAATCA AACTAGAAAA GTAAGATAGG AGCATTCATG CAGTATTCAG 1080 

AAATTATGAT TCGCTACGGA GAGTTGTCAA CCAAGGGTAA AAACCGTATG CGTTTCATCA 1140 
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ATAAACTTCG TAATAATATT TCGGACGTTT TGTCTATCTA TACCCAAGTT AAGGTAACAG 12 00 

CAGATCGCGA CCGTGCCCAC GCTTACCTCA ATGGAGCTGA TTACACAGCA GTTGCAGAAT 12 60 

CTCTCAAACA AGTTTTTGGA ATTCAAAACT TTTCTCCTGT TTATAAGGTT GAAAAATCTG 132 0 

TAGAAGTTTT GAAGTCTTCT GTCCAAGAGA TTATGCGGGA CAT C T AC AAG GAAGGTATGA 13 8 0 

CCTTTAAGAT TTCTAGCAAG CGTAGCGACC ACAACTTTGA ACTTGATAGT CGTGAACTCA 1440 

ACCAAACACT TGGAGGGGCT GT ATT C G AAG CCATTCCAAA TGTGCAAGTT CAAATGAAAA 15 00 

GTCCTGACAT CAATCTTCAG GTGGAGATTC GTGAAGAAGC AGCCTATCTT TCTTATGAAA 1560 

CCATTCGTGG GGCTGGTGGT TTGCCAGTTG GAACTTCAGG TAAAGGGATG CTCATGTTGT 162 0 

CAGGAGGGAT TGACTCACCT GTAGCAGGTT ATCTTGCTCT TAAGCGTGGG GTGG AT AT C G 168 0 

AGGCAGTTCA CTTTGCTAGT CCACCATATA CTAGTCCTGG TGCCCTCAAG AAAGCGCAGG 1740 

ACTTGACCCG T AAATTG AC C AAGTTTGGCG GAAATATCCA GTTTATAGAG GTGCCTTTCA 1800 

C AG AG ATT C A AGAGGAAATC AAAGCCAAAG CGCC AG AAG C TTATTTGATG ACTCTAACTC 18 60 

GTCGCTTTAT GATGCGGATT ACTGACCGTA TTCGTGAGGT ACGAAATGGT TTGGTTATCA 192 0 

TCAATGGGGA AAGTCTAGGT CAAGTAGCCA GCCAAACCCT TGAAAGTATG AAGGCTATCA 1980 

ATGCTGTTAC CAACACTCCC ATCATTCGTC CTGTGGTTAC C AT GG AC AAG TTGGAAATCA 2 040 

TTGACATCGC CCAGGAAATC GATACCTTTG AC AT TTC AAT CCAACCGTTT GAAGACTGTT 2100 

GTACCATTTT TGCACCAGAT CGTCCAAAAA CAAATCCTAA AATTAAGAAT GCGGAGCAGT 2160 

ACGAAGCGCG TATGGATGTT GAAGGCTTGG TTGAGCGAGC AGTGGCTGGA AT C ATG AT T A 2 22 0 

CTGAAATCAC ACCTCAAGCC GAAAAAGATG AAGTTGATGA CTTGATTGAC AATCTGCTCT 2 2 80 

AATTCAGAAA AT C C AAAAGA AT AG CG AAAA TCAGTAAAAA AAGTTAGTTT TTTCTCTAAA 2 3 40 

AACAGGTAAA AAACTAACTT TTTTTATTTT TATGATATAA TGATATAAAA TTTTGAATAT 2400 

AGAGAGTTTT CTGACAATGA ATCAATCCTA CTTTTATCTA AAAATGAAAG AACACAAACT 246 0 

CAAGGTTCCT TATACAGGTA AGGAGCGCCG TGTACGTATT CTTCTTCCTA AAG AT T ATG A 2 52 0 

GAAAGATACA GACCGTTCCT ATCCTGTTGT ATACTTTCAT GACGGGCAAA ATGTTTTTAA 2 580 

TAGCAAAGAG TCTTTCATTG GACATTCATG GAAGATTATC CCAGCTATCA AACGAAATCC 2 640 

GGATATCAGT CGCATGATTG TCGTTGCTAT TGACAATGAT GGT ATGGGG C GGATGAATGA 2 7 00 

GTATGCGGCT TGGAAGTTCC AAGAATCTCC TATCCCAGGG CAGCAGTTTG GTGGTAAGGG 2 7 60 

TGTGGAGTAT GCTGAGTTTG TCATGGAGGT GGTCAAGCCT TTTATCGATG AGACCTATCG 2820 

T AC AAAAG C A GACTGCCAGC ATACGGCTAT GATTGGTTCC T C ACT AGG AG GCAATATTAC 2880 

CCAGTTTATC GGTTTGGAAT ACCAAGACCA AATTGGTTGC TTGGGCGTTT TTTCATCTGC 2 940 
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AAACTGGCTC CACCAAGAAG CCTTTAACCG CTATTTCGAG TGCCAGAAAC TATCGCCTGA 3 000 

CCAGCGCATC TT CATC TAT G TAGGAACAGA AGAAGCAGAT GATACAGACA AGACCTTGAT 3 0 60 

GGATGGCAAT ATCAAACAAG CCTATATCGA CTCGTCGCTT TGCTATTACC AT G ATTTG AT 312 0 

AGCAGGGGGA GTACATCTGG ATAATCTTGT GCTAAAAGTT CAGTCTGGTG C CAT C CAT AG 318 0 

TGAAATCCCT TGGTCAGAAA ATCTACCAGA TTGTCTGAGA TTTTTTGCAG AAAAATGGTA 3 2 40 

AGTTAAGAAA GGAAAAAACG AAATGCATAT TGAACATCTT AGCCACTGGA GTGGTCATCT 3 300 

TAACCGTGAA ATGTACCTTA ACCGTTATGG ACATGGTGGG ATTCCAGTTG TGGTCTTTGC 3 3 60 

TTCATCAGGT GGTAGTCACA ACGAATACTA TGATTTTGGC ATGATTGATG CCTGTGCTTC 342 0 

CTTTATCGAG GAAGGCCTTG TCCAGTTCTT TACCCTATCT AGTTTGGATA GTGAGAGCTG 3480 

GTTGGCTACT TGGAAAAATG CTCATGACCA AGCGGAAATG CACCGTGCCT ACGAACGTTA 3 54 0 

TGTGATTGAG GAGGCCATTC TTTTATCAAG CACAAGACAG GTTGGTTTGA TGGCATGATG 3 600 

ACGACAGGTT GCTCTATGGG AG C C T AT CAT GCACTCAATT TCTTCCTCCA G CAT C C AG AT 3 6 60 

GTCTTTACCA AAGTGATTGC TCTCAGTGGT GTTTACGACG CACGTTTCTT TGTCGGTGAT 372 0 

TACTACAACG ATGATGCTAT TTACCAAAAC TCGCCAGTAG ATTAT ATTTG GAACCAAAAC 3 7 80 

GACGGCTGGT TTATTGACCG TTACCGTCAG GCAGAGATTG TGCTGTGTAC GGGGCTTGGA 3 840 

GCCTGGGAAC AAGATGGTTT GCCATCCTTT TACAAGCTCA AAGAAGCCTT TGACAAGAAA 3 9 00 

CAAATTCCAG CCTGGTTTGC TGAATGGGGA CATGATGTCG CCCATGACTG GGAATGGTGG 3 9 60 

CGTAAACAAA TGCCTTATTT CCTCGGTAAT C T CT ATT T AT AAAAGGAGTT ACCTATGAAT 402 0 

TACCTTGTTA TTTCTCCCTA CTATCCACAA AACTTTCAAC AGTTTACCAT CGAACTAGCT 4080 

AATAAAGGCA TCACAGTCTT GGGAATTGGT CAAGAGTCTT ACGAGCAATT GGATGAGCCC 414 0 

TTGCGCAATA GCTTGACCGA GTATTTTCGT GTTGATAATC TTGAGAACAT AGATGAAGTC 4200 

AAACGTGCAG TTGCTTTTCT CTTTTATAAA CATGGTCCAA TTGGCCGCAT CGAGTCTCAC 42 6 0 

AATGAATACT GGCTTGAGCT AGACGCAACA CTCAGAGAAC AATTCAATGT TTTTGGTGCC 432 0 

AAACCAGAGG ATCTCAAAAA GACGAAATAT AAGTCTGAAA TGAAGAAACT TTTCAAAAAA 4380 

GCAGGTGTTC CTGTGGTACC TGGAGCTGTT ATCAAGACGG AAGCAGATGT TGATCAAGCA 444 0 

GTGAAAGAAA TCGGTCTTCC AATGATTGCC AAACCTGATA ATGGAGTGGG AGCAGCCGCA 4500 

ACCTTTAAAC TTGAGACAGA AGACGATATC AATCACTTCA AGCAAGAATG GGACCATTCA 45 60 

ACCCTTTATT TCTTTGAAAA ATTTGTCACT TCCAGCGAAA TCTGTACCTT TGACGGGCTC 462 0 

GTGGACAAGG AT GG AAAGAT TGTCTTCTCA ACAACCTTTG ACT AC G C CT A TACACCGCTT 4 680 
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GACCTCATGA TTTATAAGAT GGACAATTCT TATTATGTGC TCAAGGATAT GGATCCTAAA 474 0 

CTGCGCAAGT ATGGGGAAGC AATTGTCAAA GAATTTGGTA TGAAAGAACG GTTTTTCCAT 4 800 

ATTGAGTTCT TCCGTGAGGG GGACGATTAT ATT AC CAT C G AGTACAATAA CCGCCCTGCA 4860 

GGTGGTTTTA CCATTGATGT TTATAACTTT GCTCATTCCT TGGACCTTTA TCGTGGCTAT 4920 

GCAGCTATTG TCGCAGGAGA GGAGTTCCCG GCGTCAGACT TTGAAACTCA GTATTGTTTG 4 980 

GCTACTTCTC GCCGTGCAAA TGCTCACTAT GTTTATTCAG AAGAGGATTT GCTTGCCAAA 5 04 0 

TATAGCCAGC AGTTCAAGGT TAAAAAAGTC ATGCCAGCTG CCTTCGCGGA ACTTCAAGGA 5100 

GATTACCTGT ATATGCTGAC CACTCCGAGT CGACAAGAAA TGGAGCAGAT GATTGCAGAT 5160 

TTCGGACAAC GTCAAGAATA AGAACTATCG GATTAAGGAA ATTAACTCCC TTAATCCTTT 52 2 0 

TGTTTTGTCT GATAAAAAAT AAGAGCATCC CAACAAGGTA GCTATCATAA AACTTGTTCG 52 80 

ATAACTATTT GAAGCAGGAT T AGGTGGT C A GAAATTAAAT TTTAATATTT CAATTGAGTC 5 34 0 

ATAGTATTGT GTTTGCGTAT CCTTAAATCA GCTAAAAGGA TCCATGACGA CACCTATACG 5400 

ATATAGTTTT CAAGATACCA AACAAGTCTA TTAATATTCA ATGAAAATCA AAGAGCAAAC 54 60 

TAGGAAGCTA GCCGCAGGTT TCTCAAAACA CTGTTTTGAG GTTGTGGATA GAACTGACAG 552 0 

AGTCAGTATC ATATACTACG GCAAGGTGAA GCTGACGTGG TTTGAAGAGA TTTTCGAAGA 558 0 

GTATAAAATA TTCAGGTGAC GCATAGATAT AGTTAATTGA AGCTTTGTTT GAAATCTGAT 5 640 

AAAATAATGA TATTACTAAG TTTTAAAAAC TAAAGAAAAG GGAAGATATG ATTACAGGCG 57 00 

AATTAAAAAA TAAAATCGAT CAGCTGTGGG AAATTCTTTG GACAGAAGGA AACGCAAATC 57 60 

CTTTAACAAA TATTGAACAG TTGACTTATC TCTTATTTAT GAAAGATTTG GATAGTGTCG 582 0 

AGCTTGGACG T G AAAGTG AT GCTGAATTTC TAGGGATTCC TTATGAGGGA GTTTTTCCAA 5 880 

AAGATAAACC TGAATACCGT TGGTCAACTT TTAAAAATAT AGGAGATGCT CAGGAAGTTT 5 940 

ATCGTTTAAT GACTCAGGAG ATTTTTCCGT TTATTAAAAA TCTCAAGGGG GATACAGATG 6 000 

ATACAGCCTT TTCACGATAT ATGCGAGAAG CTATTTTTCA AATAAATAAA CCTGCTACGC 6060 

TTCAAAAGGC AATTTCTATC TTAGATGTTT TTCCAACTAG GGGATTAGAT GTAGATTTTG 6120 

ATAATGACAA ACAAAGTATT ACTGATATCG GAGATATCTA TGAATATCTG TTATCAAAAT 6180 

TGTCGACCGC AGGTAAAAAT GGACAGTTCC GTACACCTCG TCACATCATC GATATGATGG 6240 

TTGAGTTGAT GC AAC CG ACT AT C AAAG AT A TCATCTCAGA TCCCGCTATG GGTTCTGCTG 63 00 

GCTTCTTAGT ATCTGCTAGC CGTTACTTAA AGCGTAAGAA AGATGAATGG GAAACCAATA 63 60 

CAGATAATAT CAATCATTTT CATAATCAGA TGTTTCATGG AAAT GAT AC G GATACGACTA 6420 

TGTTGAGACT TGGGGCGATG AACATGATGC TACATGGAGT AGAAAATCCA CAAATCAGTT 64 80 
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ACCTTGACTC GCTGTCTCAA GATAATGAAG AAGCCGATAA AT AT AC T T TG GTTTTAGCAA 654 0 

ATCCTCCTTT TAAGGGCTCA CTTGACTACA ATTCAACCTC TAATGACCTT CTTGCAACCG 6600 

TAAAAACCAA AAAAACAGAA TTACTCTTTC TTTCTCTTTT CTTGCGAACT TTAAAACCAG 6660 

GTGGACGAGC AGCAGTTATC GTACCTGATG GTGTCCTTTT TGGTTCGTCT AAAGCTCATA 672 0 

AAGGAATTCG TCAGGAAATT GTAGAGAATC ATAAGCTTGA TGCTGTAATC TCAATGCCTA 67 80 

GTGGTGTGTT C AAGCCT T AT GCTGGAGTTT CAACTGCCAT TCTCATCTTT ACAAAAACTG 6840 

GTAATGGTGG TACTGACAAA GTCTGGTTTT ACGATATGAA AGCGGATGGT TTAAGTTTGG 6 900 

ATGATAAGCG ACAACCGATT AGCGACAATG ATATTCCAGA TATTATCGAA CGCTTTCATC 6960 

ATCTTGAAAA AGAAGCAGAA CGTCAGAGAA CGGATCAATC TTTCTTTGTT CCAGTTGCTG 7 02 0 

AGATAAAGGA AAATGATTAT GATTTGTCTA TCAATAAATA TAAAGAGATT GAGTATGAAA 70 8 0 

AAGTTGAGTA TGAACCAACA G AAGT C AT AT TAAAGAAAAT CAATGATTTA GAAAAAGAAA 714 0 

TTCAAGCTGG CTTGGCTGAA TTGGAAAAAT TACTCAAGTA GGGAGGTGGC TGTATGAAAA 72 00 

AAGTG AAGT T GGGGGAAGTC TTATCTCTAA AAAAAGGCAA GAAAGCCACT GTACTTGCTG 72 60 

AACAAACAAC TCTAAGCCAA CGTTATATTC AAATAGATGA TTTAAGAAAT AATAATAATT 732 0 

TAAAATTCAC TGAAAGTTTA AATATGACTG AAGCACTCCC AGATGATATT CTGAT AG CAT 7 3 80 

GGGATGGAGC TAATGCAGGA ACAGTTGGTT ATGGATTATC GGGAGCTGTT GGTAGTACAA 744 0 

TTACGGTCTT AAAAAAGAAT GAGCGATACA AAGAAAAAAT TATATCAGAT TACTTGGGAG 7 500 

TCTTTTTGGA AAGTAAATCG CAGTATTTAC GAGATCATTC AACAGGTGCA ACAATTCCTC 7560 

ATTTAAACAA GAATATATTA CTTGATTTAC AATTAGAATT GCTAGGTATC GAAGAACAAG 7 62 0 

AGAACATTAT CTGTATTCTT AATACGATTA AAAGGCTTAT TACTAAAAGA AAATTTCAGT 7 680 

TAGATGAACT AAACTTGCTC GTCAAATCCC GATTTAACGA GATGTTTGGG GAAAATAAAA 774 0 

TATTTGAAAG CATTGATAAC TTATTTGATA TTATAGATGG TGATAGGGGC AAAAAT TAT C 7 800 

CTAAATCAGA TGAGTTGTTT AGTGAGGAGT ACTGTTTATT TTTAAATACA AAGAATGTTA 7860 

CTAAAAACGG ATTTTCATTC GATACAAAGC AATTTATCAC TAAAACAAAG GATAAATTAC 792 0 

TTCGAAAAGG CAAACTTGAG CGTTATGATA TAGTCTTGAC AACAAGAGGT ACTGTTGGAA 7 980 

ATGTAGCGTA CTACGATGAA TTAATAAAAT ATAAACATTT ACGTATAAAT TCAGGTATGG 804 0 

TAATATTACG TCCCAAGACA CCAAATCTAA AT C AG AAATT TATTATCCAT GTTTTAAGGA 8100 

ATAATAATTA TAGTCGAGTG ATATCAGGAA GTGCTCAGCC TCAGTTACCA ATTACAAAAT 8160 

TAAAAAAAAT ACTTCTCCCC CTCCCCCCAC TAGCCCTCCA AAATGAGTTC GCAGACTTTG 822 0 
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TAGTCCAGGT 


CGACAAATCA 


CAATTTGCTT 


GTGAGATAGC 


TATAAAAGTG 


TG GAG AAAT A 


8280 


GCTTGAAATT 


TAGTATAATA 


TAGCTAAACT 


ATTTGTTTAA 


AGTGAGAAAA 


AAATGGGAAA 


8340 


TTTTAGCTTT 


CTTTTAAAAA 


ATGACGAATA 


TGAATCTTTT 


TCAAAACCTT 


GCATTGAAGC 


8400 


TGAGAATATG 


ATTGCTACAT 


CAACTGTGGC 


TACTGCCTTT 


ATGGCGCGTC 


GTGCTTTAGA 


8460 


GCAGGCTGTC 


CATTGGATAT 


ATAGTCACGA 


TTCATATTTA 


GAAGCTCCCT 


ATCGTGCTAC 


8520 


TCTATCTTCT 


TTAGTATGGG 


ATGATGATTT 


T AGGG AT AT C 


GTAGATTCTG 


AACTCCACAA 


8580 


GCAGATAGTT 


CTGTTGATTC 


GGTGGGGAAA 


CCATGCTGCT 


CATGGTGGTG 


AAATTAAGGA 


8640 


ACGAGAAGCG 


ATTTTAGCTT 


TGCATCATTT 


GTATCAGTTT 


GTTAATTTTA 


TCG ATT ATTG 


8700 


TTACAGCAAT 


GAGTTTGTGG 


AGCGTTATTT 


TGATGAGAAG 


TGCTTACCAC 


TTTCAGCAAA 


8760 


CAT C AAAT AC 


CGAGAAACTC 


CACAATCTAT 


GATAAAGTTA 


CAAGACAGTT 


TACCAGAACT 


8820 


GCCTGATTTT 


CATGAACAGA 


TGGCTGCTCA 


GTCCGTAGAA 


GTTCAAGAGA 


CTTATACTGA 


8880 


AAAACGT G AG 


ACTGCAGCGC 


AACGGCAAGA 


TGTGCCTTTC 


CATATTGATC 


AATTATCTGA 


8940 


GGCAGAGACA 


AGAAAGCTCT 


T T ATTG AT AT 


CGATCTCCGT 


TTAGCAGGAT 


GGATATTTGA 


9000 


AGAAAACTGT 


CGTGTTGAGA 


TAGCCGTTGA 


TGGTCTCAAG 


CACGGTTCAG 


GAATTGGTTA 


9060 


CTGTG AC TAT 


GTACTTTATG 


GTAAAAATGG 


GAAAATTTTA 


GCGATTGTGG 


AGGCTAAAAA 


9120 


AGCCTCTGTC 


AATCCAGAAG 


TAGGGGAAGT 


ACAGGTCAAA 


GAATATGCTG 


AAGCTTTGGA 


9180 


GAAAC AT AT C 


GGCTATCAGC 


CAATTTGCTT 


TAT T AC AAAT 


GGGTTGAAGC 


ACTATATACT 


9240 


TGATGGTCCG 


AACCGCCGCC 


AGATTGCAGG 


CTTTTACTCT 


CAAGAAGAAT 


TGCAATTAGT 


9300 


G AT GG AT AG A 


CGTCATCTTC 


AAAAACCGCT 


TGAGGATATT 


TCTAGTAAAA 


TTAGGGACGA 


9360 


TATTTCCGGG 


CGTCACTACC 


AAAAACATGC 


CATTGCAAGC 


GTTTGTGAAG 


CTTTCTCTGA 


9420 


TCATCGTAGA 


CAGGCACTTT 


TGGTTATGGC 


AACTGGGGCG 


GGGAAAACTC 


GTACAGCAGT 


9480 


TTCTCTAGTT 


GATATCTTAT 


CACGTCATAA 


CTGGGTAAAA 


AACGTTCTCT 


TCTTAGCCGA 


9540 


TAGAACTTCC 


TTGGTTAAGC 


AAGCATATGA 


TTCGTTTAGA 


AAAT TACT CC 


CAGATCTTTC 


9600 


CGTTTGTAAC 


TTCTTAGAAG 


ATAAAGAAGG 


AGCTCAATCA 


AGTCGCATGG 


TCTTTTCAAC 


9660 


TTATCCGACC 


ATGATTGGAG 


CGATTAGTGG 


TCAAGAAGAA 


GTAAATCAAC 


GCCCTTTCAC 


9720 


TGTTGGGCAT 


TTTGACCTTA 


TCATAATTGA 


CGAATCTCAC 


CGTTCTATTT 


AT C AG AAAT A 


9780 


CAAGTCCATT 


TTTGATTATT 


TTGATGCAAG 


AATTGTAGGC 


TTAACAGCTA 


CTCCGCGTCA 


9840 


AGATTTAGAT 


AAAAACACCT 


ATGGATTCTT 


TAATTTGGAG 


AATGGGGTTC 


CAACATATGC 


9900 


ATATGATTTG 


GAAGAGGCTG 


TTAAAGACGG 


AT ATT T AGT A 


GCCTATCATT 


C T AT CGAAAC 


9960 


CAAACTGAAA 


CTACCTACGG 


ATGGTCTACA 


TTATGATGAT 


TTGTCCGAAG 


AAGAAAAGGA 


10020 
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ACATTTTGAT 


AGCAAATTTG 


AAGACAATAG 


CTGTGAAAAA 


GATATTGATG 


GGAGTGTATT 


10080 


TAATTCCTTT 


ATTTTCAATA 


AAAGTACAGT 


AGAAATTGTT 


TTAAATGAAC 


TCATGACAAG 


10140 


AGGAATTCAG 


ACAGCCTCGG 


GTGATGAAAT 


TGGTAAAACT 


ATTATTTTTG 


CTAAAAATCA 


10200 


TGATCATGCG 


GAATATATCA 


GAGGTATTTT 


TAACAACCGC 


TATCCTGAAA 


AAGGGAGCGA 


10260 


CTATGCTCAG 


GTGATTGATT 


ATAGTATTAA 


G C ATT AT C AG 


ACCTTGATTG 


ATGATTTTAA 


10320 


AATTAAGGAG 


AAGTATCCTC 


AAATTGCGAT 


TTCTGTCGAT 


ATGTTAGATA 


CAGGTATTGA 


10380 


TGT AC C AG AG 


GTTGTTAATT 


TAGTCTTCTT 


CAAGAAAGTA 


CGCTCTAAAA 


CTAAGTTTTG 


10440 


GCAGATGATT 


GGTCGAGGAA 


CCCGTCTATG 


TAAAGATTTA 


TTTGGACCTG 


AG C AGG AT AA 


10500 


GGAAAACTTC 


TTGGTATTTG 


ATTATGGGGA 


CAATTTTGAT 


TATTTTCGTG 


CAGATCCAAG 


10560 


AGATGGAGAG 


GGTCGTCACA 


TTGTTTCGCT 


GACTCAGCGT 


TTATTTAATA 


TCAAAGTGGA 


10620 


CTTGATTCGA 


GAACTTCAGG 


GACTCCAATA 


C C AAG AAG AT 


CAGTTTGCGA 


GAGCATACCG 


10680 


T C AG C AG C TT 


GTCTCGGAAC 


TTCAAGGTCG 


TATAGAGAGC 


TTAAATGAGT 


TGGACTTCAG 


10740 


GGTTCGTATG 


GT T T TAG AT A 


CAGTTTATAG 


C T AT AGG AAA 


TTGGAAAGTT 


GGCAGAATCT 


10800 


AACTGCTGTT 


ACAAGTGAAA 


CCATTCAAAA 


AAATCTCTCT 


CCGCTTTTAT 


TTGATGAAGA 


10860 


TAAAGAAGAT 


GAGATGGCGA 


GG AG AT T T G A 


TTTGTGGTTG 


CTTCATATTC 


AGTTGGGGCA 


10920 


ACTGACAGCT 


AAATCTTCCA 


CTGTTCATAT 


TTCCCAAGTG 


ATGAAGACGG 


CTAGAGCTCT 


10980 


TTCTGCTATT 


GGCAATATCC 


CGCAGGTTTT 


TGAGCAGGCT 


GAAATTATCA 


GGAAAGTACA 


11040 


GGAGCCTGAA 


TTTTGGAAAG 


AAGTTAACTT 


GT C T G ATT TG 


GAAAAAATTC 


GTCTTGCTAT 


11100 


TCGAGATTTA 


TTACAGTTTT 


TGGATAAAAC 


AGACCGTAAA 


CCCTACTATG 


TTAACTTTGA 


11160 


AGATCGTATA 


CTCTCCACTG 


TTCACGAGAC 


CACAGCATTT 


TTGCAGGTCA 


ACGATCTTCG 


11220 


GTCTTACAAT 


GAAAAAGTTG 


AGCATTATTT 


GAAAACTCAT 


CTGGATGAGG 


AGTCCATTTC 


11280 


TAAGCTATAC 


CATAATAAAA 


AGTTGACATC 


TGATGATATG 


CTTGCACTTG 


AAAAATTGCT 


11340 


TTGGGAAAAA 


TTAGGTAGTA 


AAG C AG ACT A 


CCAAAGTCAT 


TATGAAAATA 


AGGCAATTCC 


11400 


GAGATTGGTT 


CGTGAGATTA 


TTGGCTTAGA 


TAGAGAGTCT 


GCCAATCGTA 


TTTTTTCTAA 


11460 


ATTTTTGTCG 


GATGAGAATC 


TTAATGCCAG 


GCAGATTTCA 


TTTGTAAAAT 


TG AT T GT AG A 


11520 


CTACATTGTA 


GAAAATGGTT 


TTTTAGAGAC 


GAAAGTGTTA 


AC GC AAG AG C 


CGTTTAAATC 


11580 


TTATGGTTCT 


GTTCAACTAC 


TCTTCCAACA 


CCAACTACCA 


GTACTTCGTA 


ATATTGTTCA 


11640 


AATCATTGAA 


CTTATCAATA 


ATCGAGCTGG 


AGAAGCGGCT 


TAAATTCTAA 


AGTGATTGCC 


11700 


ATGCTGAGAC 


TCATTTAAAA 


TTAAAAAGAG 


TAGAAATTTA 


TGCTATATAT 


G AG AAGTTT T 


11760 
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ATTAGGAAGA ATGTCATCGT TTTCCTAGAA TACAGTATCA GTTGTTAAGT GG T T GAT AAA 1182 0 

TTTCAAAGTA GATACTTGTA CCACGATGTT TGTTGATCGA GTTATTAACA AAAGAGCTAC 11880 

TTTGATTTTA AAGAAATAGA AAACAAAAAG CCGAGCAAGA ATTCAATTGC AGGAGAAAAT 11940 

GAAATAATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAA CTAGCTGCAG GCTGCTCAAA 12 000 

ACACTGTTTT GAGGTTGCAG ATGGAAGCTG ACGCGGATTG AAGAGATTTT C G AAGAGT AT 12 060 

AAATCTTCCT AGGATAAAGC AAAAC GC AT A GTATCAAGGG TTTTCAACAC TTGATACTAT 1212 0 

GCGTTTTCTG AT GTT AAAG A CTTTCTACCA GGTTTTTTAA AAGCATAATT GTTAGTTGTA 12180 

GTCATTTATT ATTCTTCAAA GAAAAATGGT GGGGCGAATT TTTTCAGTTC TTCAAAGCAC 12240 

TTTTGAGCAG TAT CTG CATC TTCACAGATG ATAAGACAGA CATCATTACC ACAAAGGGTA 12300 

GCGATAGCGT CAGGG AAGCT C AAAGT AT C A ATGATAGAAC CAAAGGATTG AGCCAGTCCA 123 60 

GGAAGGGTTT TTAGTAGGAC TTGGTGTTGA ACTGGGCGCA TCCAGACAAG GGCGTCTTCC 1242 0 

ATGTAGAGTT CGAGACGTTT TTCCCATTTT GAGATGGAAC CATTGTTAAG AACATAATAA 12 4 80 

GCGCTATCTT CTTCGCGGAC TTTTGATAGG TTCATATTTT TGATGTCGCG TGAGAGGGTT 12540 

GCCTGGGTTA CTTGAATGTC GTTCTCAGCA AGAAGGGCTT GCAACTCAGC CTGTGTATGA 12 60 0 

ATCTTGTTTT TTGTGATAAG AG CG CGT AT A AGTTGGTGGC GGTGTTCTGA TTTATTCATA 12 660 

ATAATGTAAC TCCTTTTAGC AAGGTAAGGT AAGCATGGAC TG AG C GAG G T CGACAGTCAA 1272 0 

GTGGTAGTCT GTATTGTCAC GGATGGTGAT TTCAAAGTCA GTAGTATAGA GGACTAAACG 127 80 

GAGAGTGTCT CCTTCTTTTA GCTTGTAAAT AGTTGGCTGC AGTTCAAATT GAACGTCCAT 12 840 

CCATTCATCT GCAGTAATAT CCTCTACTAA CAGTAAATCA TTTCTATTTT GTAAATTAAG 12 9 00 

GTAACCTTTT GTCACGACTC GTTGTGCCTC TGGTCTAAAT GGCAATTCAC AGAGATTTTC 12 9 60 

CAACATGTGA TAGCGACCGT TGTCAATGGT TCTAGCACTT AAAAT AG CTG GATAAGGTTG 13 020 

TAGGTATTTC TTTTGCCCAA ATTCTAGCAG TTGGGCAGAT AAGAGCCCCT TGTTTGTACT 13080 

GGATTTGATA CGAAGATTGA GCTGAGCGCG ACCGTTTAGG TGGAGATCTT TAGTCACAGG 1314 0 

AAGGTTAATA GTAATCTGAT TGGCTTTCCC TTG AT AG AG C TCTGTATTGA AGGTTTGGTA 132 00 

TGTCTTACCA TAGCGCTCAA AATCCTTATC TGGGTACTGG TTTTGAATAG CTTGCTCTTC 132 60 

TTGACCAAGT GAGAAGGTTT CACAGTTTTC TTGCCCACCG AAGTTATCAA GTGATAACCA 13320 

AGTCTGTGGA GCTGTATTGT CCTGCCAGAT AACAGTAGGA AGTTGAAAGT CTGTTTCCTG 13 380 

TCCTAGTAAT TTCTTGGTCA ATAAGGCATT T AT GG AC T C A CGGAAGTCAA TTGATTGCCA 13 440 

ATTGTTCATG TAAACATGGG C AC C ATT AT G GAAAAAGAGA TGCTTGTGTA TATGAGTAGG 13 500 

AAGAGCATGG AACATCTGGT AAACATGAAG TGGTTTGACA TTCCAATCCT GAGAACCATG 13 560 
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AGTAAAGACA ACCTCTGCCT TTACTTTATG GGCATTGAGC AGATAATTGC GGTCATGCCA 13 620 

AAACTGATTG TAGTCCCCAG TTTTTCGGTC TAGCTGAGCT TTCACTTTTT CTAAGTCAGC 13 6 80 

TTGGTGAGCT TCATTGCCAC G GAT AT AG T C GCCAGCTAAG AGATTACGAG AATAGGTTAA 13740 

CTCAGCAAGG GAGTCAAAGT CCTCACCTGG ATAACCACCT GGGCTAGTCA CCAGACCGTT 13 800 

TTCACGGTAG TAGTTGTACC ATGATGAAAT TCCTGCCTCG GCAATGATAA CTTCTAAACC 13 8 60 

ATCGACTCCT GTAGTCGCAA GACCATTGGA CATGGTACCT AGATAGGAAA GTCCTGTTGT 13 920 

AGCAACTTTT CCGTTTGACC AATCAGCCTT GACTTGACGC TGGCGCGTGT GATCAGTAAA 13 980 

GGCACGGCAA CGACCGTTAA GCCAATCGAT GACATTTTTA TAAGCCTCGA TTTGCTGGTA 14040 

GTCTCCATTA GTCATGAAAC CTGTCGAGTC TTTGGTACCA ACACCTGAGA CATAGAGATT 14100 

GGCAAAGCCT CTCGGAAGGA AGTAGTCGTT TAGTGTATAG CTAGAGTTGA TGTGAGTTAG 14160 

CTTTTCCTCA GCCTCTGCTA TAAGCTCAGC TTTACCTTGG GGTTGGACGA GATTTAGTTG 1422 0 

AGGTTTCTCT AGCTCAATCT TGTGAGGAAG CTTAACCTCA AGCTCGCCCT CCATCTTGTA 142 80 

GAGAGCCTTG TCACTAGCCT TGTCATTGGT TCCCTGATGA TAAGGGCTGG CTGTCATGAT 1434 0 

GGCAGGGATT TTTCCATCAA AACGAGGGCG AATAATGCTA ACCTTTACTA GGTCTGATAG 14400 

CCCTTTTTGG TCAGTATCGA CACGAGACTC AACGTAAACG ACTTCACGAA T G AC AT C CT G 144 60 

GTTAGAAAAA GTAGCCAAAC TCTTGCCGTT AAAGTAGTGG TAGTCATTAT CCTCCGGAAT 14520 

AAG AC C AT C A CTAACAAGTT GGTCGATAAG AGTATTTCCT TTTTTGGTGC GAGTATTGAG 14580 

TAACTGATAG AGATTTTCAA TCAAGTCACC ATATATAATG GGAAATCCAG TTTCTTTACG 14 640 

AAAAACGTCA CTATCTTCGA AGTCAACCAA ATAAGAAAAG CCTAAAAGTT GAAAAGCAAC 147 00 

AGTATAAAAA ATATCTGCTG TCAGTTCATC TTCTGATTGA AAAAATGTCA GCAGGTCTGT 147 6 0 

TTTTTTATCA GCTGCTAGGA TAGAAAGTGG GTAGTTGGTG TCTTGATAAG TGAAAAAGAA 1482 0 

ACGACGTAAA AAGGTTTCAA GTGAGTCTTT GTGATTGGCT GTATTTTGTA AATCAAAGCC 14 880 

ACATTTTTTT AGTTCAGATA AGACATTTTC TTTTGGAAAA TTGATATAAC TATATTGATT 14 940 

AAAACGCATA GAACCTCCAT ATAGAATGAC AGTTAAGGTT ATTATATCAA AAAAAAAGCA 15000 

GAAAGGGAAT TGTTAACTTC AAAAGGAAAT AATCCAATAA AAATGAATAA AGTACTAAAT 150 60 

TCAATATAGA GAACAGAGTA ACAATAAGAA TAAATAGATA GGGTATAAAA GTTCTAGGAG 1512 0 

ATTTATATTA TATGCTTTCT ATTTTTATAT ACAATATAGT ATAAATATAA AAATGATGAC 15180 

AAAAATACAA ATGAATAGAA AAT AAAT TAG TAAGCTGATG AAATTTTTCT CAAGAGAAGC 15240 

CATTTATAGG TGAAAATGGT ATAATATAGT GAGAAGGATA GAGGAGAAGT GTAAATTGAT 15300 
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CGCACAACTA GATACAAAAA CAGTCTATAG TTTTATGGAA AGCGTCATTT CGATCGAAAA 153 60 

GTATGTGAGA GCAGCTAAAG AATACGGCTA CACTCATTTG GCTATGATGG ATATTGACAA 15420 

TCTTTATGGC GCTTTCGACT TTCTAGAGAT TACAAAAAAA TACGGCATTC ATCCTTTGCT 154 80 

AGGGCTTGAA ATGACAGTGT TTGT AG AT G A TCAGGGAGTG AATTTGCGCT TTTTAGCTCT 15540 

ATCTAGTGTG GGCTATCAGC AGTTGATGAA GCTTTCGACA GCCAAGATGC AGGGGGAGAA 15 600 

AACTTGGTCA GTCCTGTCCC AGTACCTGGA GGATATCGCG GTCATTGTGC CTTATTTTGA 15660 

TAGAGTTGAG TCGTTAGAAC TAGGCTGTGA TTACTATATA GGGGTTTATC CAGAAACACT 15720 

AGCAAGCGAA TTTCATCATC CTATCTTACC TCTTTATCGG GTCAACGCTT TTGAAAGCAG 157 80 

GGATAGAGAA GTTCTTCAAG TTTTAACAGC GATTAAAGAA AATCTACCGC TCAGAGAAGT 15840 

TCCCTTGCGT TCGAGACAAG ATGTCTTTAT ATCAGCAAGT TCTTTAGAGA AACTATTCCA 15900 

AGAGCGTTTT CCGCAAGCTT TGGACAATTT AGAAAAGCTT ATTTCAGGCA TTTCTTACGA 15 9 60 

CTTGGATACT AGTCTGAAAC TGCCTCGTTT TAATCCAGCT AGACCAGCAG T AG AGG AG T T 16020 

GAGAGAGCGT GCTGAACTGG GGCTTGTTCA GAAGGGGTTG ACTAGTAAAG AATATCAAGA 1608 0 

TAGACTAGAC CAAGAATTGT CTGTTATTCA TGATATGGGC TTTGATGATT ATTTCTTGGT 16140 

TGTTTGGGAT TTGTTGCGTT TTGGACAATC GAATGGCTAT TATATGGGAA TGGGAAGGGG 162 00 

TTCTGCAGTA GGCAGTTTGG TTTCTTATGC CTT AG AC AT C ACGGGGATTG ACCCAGTAGA 16260 

GAAAAATCTG ATTTTTGAAC GCTTTCTTAA TCGTGAACGC TATACCATGC CTGATATTGA 1632 0 

TATTGATATC CCAGATATTT ATCGTCCAGA TTTTATCAGA TATGTTGGTA ATAAATATGG 163 80 

TAGTAAACAT GCGGCACAAA TCGTTACTTT TTCAACCTTT GGAGCCAAGC AAGCTCTTCG 16440 

AGATGTCTTG AAACGCTTTG GTGTGCCAGA GTATGAATTA TCTGCAATTA CTAAGAAAAT 16 500 

CAGTTTTCGT GACAATCTTA AGTCGGCCTA TGAGGGAAAT CTCCAGTTTC GTCAGCAAAT 16560 

CAATAGTAAG TTAGAATACC AAAAAGCTTT TGAGATTGCT TGCAAGATAG AGGGCTATCC 16620 

AAGGCAAACC TCTGTCCATG CGGCTGGTGT TGTAATTAGT GACCAAGATT TAACCAACTA 16 68 0 

CATTCCTCTA AAGTATGGTG ATGAAATTCC ACTGACTCAG TATGATGCTC ATGGAGTTGA 16740 

GGCTAGCGGA CTTTTGAAGA TGGACTTTCT GGGACTACGA AATTTGACCT TTGT C C AG AA 16800 

GATGCAAGAG TTGCTTGCTG AAACAGAAGG TATTCATCTG AAAATTGAAG AAATCGATTT 168 60 

AGAAGACAAA GAAACGTTAG CTTTATTTGC CTCTGGTAAT ACAAAAGGTA TCTTTCAATT 1692 0 

TGAGCAACCA GGTGCCATTC GTCTGCTTAA GCGTGTGCAA CCAGTCTGTT TTGAAGATGT 169 8 0 

CGTCGCGACT ACTTCTCTAA ATCGACCGGG TGCTAGTGAC TATATCAATA ATTTTGTGGC 17 040 

AAGAAAG CAT GGGCAGGAAG AAGTG AC TGT TCTGGATCCA GTACTGGAGG ATATTTTGGC 17100 
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TCCAACCTAC GGCATAATGC TCTATCAGGA GCAGGTTATG CAGGTTGCCC AGCGACTTGC 17160 

CGGATTTAGT CTTGGGAAAG CCGATATTTT GCGTCGGGCT ATGGGGAAAA AGGATGCCTC 1722 0 

TGCCATGCAT GAGATGAGGG CTTCCTTTAT TCAAGGTTCA TTAGAAGCTG GTCATACTGT 17280 

GGAAAAAGCA GAGCAGGTCT TTGATGTTAT GGAGAAGTTT GCAGGTTATG GTTTTAACAG 17340 

GTCACACGCC TATGCCTACT CAGCCTTGGC CTTCCAGTTG GCTTATTTCA AAACGCATTA 17400 

TCCAGCCATT TTTTATCAGG TCATGTTAAA TTCTTCCAAC AGTGATTACT TAATAGATGC 174 60 

ACT T G AAGCA GGTTTTGAAG TAGCCTCTCT ATCCATCAAC ACCATTCCCT ATCACGATAA 17 52 0 

AATTGCCAAC AAGGCCATCT ATCTAGGTTT GAAATCCATT AAAGGAGTCA GTAATGATTT 17580 

AGCTCTCTGG AT TAT TG AAA ATAGACCTTA TTCTAACATT GAAGATTTTA TAGCTAAATT 17 64 0 

ACCTGAGAAT TATCTGAAAC TTCCTCTGCT AGAACCTTTG GTAAAAGTTG GTCTTTTCGA 17700 

TTCATTTGAA AAAAATCGTC AAAAAGTATT TAATAACTTA GCTAATCTAT TTGAATTTGT 17760 

GAAAGAGTTG GGAAGTTTGT TTGGAGATGC TATTTATAGT TGGCAGGAAT CGGAAGATTG 17 82 0 

GACGGAACAA GAAAAATTTT ATATGGAACA AGAGCTTTTA GGGATAGGTG TCAGCAAACA 17 880 

TCCACTACAA GCTATTGCAA GTAAGGCTAT TTACCCGATT ACCCCAATCG GAAATTTGTC 17 94 0 

AGAAAATAGC TATGCTATTA TCTTGGTTGA AGTTCAGAAA ATAAAAGTGA TTCGTACCAA 180 00 

AAAGGGTGAA AATATGGCCT TCTTACAGGC AG AT G AT AGT AAGAAAAAAT TGGATGTCAC 180 60 

TCTCTTTTCA GACTTATATC GTCAGGTTGG ACAGGAAATA AAAGAGGGAG CCTTCTACTA 1812 0 

TGTAAAAGGA AAAATACAAT CACGTGATGG CCGTCTGCAA ATGATTGCAC AAGAAATAAG 18180 

AGAAGCAGTT GCTGAACGCT TTTGGATACA GGTGAAAAAT CATGAATCGG ATCAAGAAAT 18240 

TTCACGCATT TTAGAACAAT TTAAAGGCCC AATCCCAGTC ATCATCCGGT ATGAAGAGGA 18300 

ACAGAAAACC ATCGTTTCTC CCCATCATTT TGTAGCTAAA TCCAATGAAT TAGAGGAGAA 183 60 

ATTGAATGAA ATCGTTATGA AAACGATTTA TCGCTAAAAA TACGGAAAAT AGAAGAATTT 18420 

TCAACGTAAA TGTGGTATAA TCAGTAAGAA TGTTAAAAGA AAAAGGAGCA TAACCAATAT 18480 

GAAACGTATT GCTGTTTTGA CTAGTGGTGG AGACGCCCCT GGTATGAACG CTGCCATCCG 18 54 0 

TGCAGTTGTT CGTCAAGCAA TTTCAGAAGG AATGGAAGTT TTTGGTATCT ATGACGGATA 18 600 

TGCTGGTATG GTTGCCGGTG AAATTCATCC CCTAGATGCA GCTTCAGTAG GGGACATCAT 18 6 60 

TTCTCGTGGT GGTACTTTCC TTCACTCAGC TCGTTACCCA GAGTTCGCTC AACTTGAAGG 18720 

GCAACTTAAA GGGATTGAGC AATTGAAAAA ACACGGAATT GAAGGTGTAG TTGTTATCGG 187 80 

TGGTGACGGA TCTTACCACG GCGCTATGCG TTTGACTGAA CATGGCTTCC CAGCTATTGG 18 84 0 
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TCTTCCAGGT ACAATCGATA ACGATATCGT TGGTACTGAC TTTACAATCG GTTTTGACAC 18900 

AGCGGTTACT ACTGCCATGG ACGCTATCGA TAAGATTCGT GAT AC AT CAT CAAGTCACCG 189 60 

TCGTACTTTT GTAATCGAAG TTATGGGACG TAACGCTGGT GATATCGCTC TTTGGGCTGG 19 02 0 

TATTGCAACT GGTGCTGATG AAATCATCAT CCCTGAAGCA GGCTTCAAGA TGGAAGATAT 19080 

CGTAGCAAGC ATCAAAGCTG GTTATGAATG TGGTAAAAAA CACAATATTA TCGTCTTAGC 19140 

TGAAGGTGTG ATGTCAGCGG CTGAATTTGG TCAAAAACTT AAAGAAGCTG GAGATACAAG 192 0 0 

CGACCTTCGT GTAACAGAAC TTGGACATAT TCAACGTGGT GGTTCTCCAA CTGCGCGTGA 192 60 

CCGTGTTTTG GCGTCACGTA TGGGTGCACA TGCTGTTAAA CTTCTTAAAG AAGGTATCGG 19320 

TGGTGTTGCG GTTGGTATTC GTAACGAAAA AATGGTTGAA AATCCAATTC TTGGTACTGC 193 80 

AGAAGAAGGG GCATTGTTTA GCCTTACTGC AGAAGGTAAG ATTGTGGTTA ACAACCCAGC 1944 0 

TACAAA 1944 6 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

TCGTAAATAT GCTCTGTTTT TGGATTTTGT TTCTTAATCT GTTTGGCAAG TGCCTTCATC 6 0 

ATAGAAATAG GACCACACAT ATAGACGGTT GCATGTTCGG GCACTTCTTT TTGTTCAAAA 12 0 

TTAAGATAGC CGTCTTTCGT ACTGTCGATT AG ATGG AG TT CAAAATTAGG ATTTTTCTGA 180 

GCATAGTTAC GGAGTAAATC TAGGTAGACT GCATTTTCAT CTCCACGGAA GCTATAGTAG 24 0 

AAGTGAACCT GTTTATCTAA AATAGGATGT TCACGGATGT AAGAGATGAA GGGGGTGATC 3 00 

CCAATACCTC CAGCAATCCA AACCTGATTT TCTCGTCCTT CTTCTATGAT CATGTGTCCG 360 

TAAGCTCTGT CTAGGGTTAC TTTGCTGCCG GCTTGAAGAT TATCATAGAT ATTCTTGGTA 42 0 

TGGTCGCCTG AAGTTTTAAC AGTAAAGTAA AGAGTTTGAC CATGACCTCC TGAGATAGAA 480 

AAGGGATGCG GAGCACTTTC AAAGCCTTCT TGGAAAATCT TTAGAAAGGC AAATTGTCCT 54 0 

GATTGATAGT TGAAAGGTCT GCTAAGATGG ATTTGAATTT CTCTAGTATC GTGATTTAAG 600 

CGTTTGAGAT GGGTAATTTT C C C TAG AT AG GGGAAGGAAA TCTTTTGATA TAGAAAAATG 6 60 

ATATAAAAAC CAGCTAGTAA GCCTAAAAGG GCATAGCTAC CAACAAGAAA ACTTAGAAGA 72 0 

TTAAATGTAA GGAGACGATT GCCCATTATC ATGTAGATGT GAAAGAGTCC TAAAATATAG 780 
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GCTAGGTAAA 


CCAGGCGGTG 


AATCCATCGC 


CAAGCTTCGT 


ATTGGATGTA 


TTTGCCTAAA 


840 


TAGGCGACAA 


GGATGATGCT 


GGCAAAGATA 


TAGATGGCAA 


GATTGCCAAA 


CTGAGCAGCT 


900 


AAGC G AG AG C 


CCCACAAACC 


G C C CAT AC T A 


AAGTTATGAA 


AGATTAGTAG 


GATGATTGAG 


960 


AGAAAGGCTG 


TGAATTTGTG 


GACGGTGTAG 


ACCTTCTCCA 


AACTGTGAAA 


CCAGCTTTCT 


1020 


AGTAGTGGGA 


GACGAGTGGC 


TAGGATAAAA 


GTCAGAGATA 


GGCTTGTTAA 


AGCTAGTCCT 


1080 


GGAATCATGA 


ATTGGGGAGA 


AGTGTTCATC 


CAAGTCAAAA 


GAGTCAAGAT 


AAAACT AG CT 


1140 


ATGATAAAGA 


GTAGTCCTTT 


GACTGATTTC 


ATAGAAAATT 


CCATTTCATT 


TAGATTTCGA 


1200 


TTTGTTGTAA 


ATAAATTTGT 


TACATTTTAT 


CATAGAAAAT 


GTATGGTGTC 


AAATTGAGGT 


1260 


CTATAAATAT 


CTACTCTCAT 


CAAAAAACTC 


TCCAATTGAA 


CTGGAGAGTG 


GCTGTTTATA 


1320 


CTCAATGAAA 


AT C AAAGAG C 


AAACTAGGAA 


GCTAGCCGCA 


AGTTGCTCAA 


AACACTGTTT 


1380 


TGAGGTTGCA 


GATAGAGCTG 


ACGTGGTTTG 


AAGAGATTTT 


CGAAGAGTGT 


TATTCTGCAG 


1440 


CTTGTTGCCA 


ACGTTTGGCT 


AGC AT AT GAG 


ACAGGCTAGA 


AATTGCTAGG 


TTAAAGCTGA 


1500 


AGTAGATGAG 


GGCAATCAGG 


ATGTAAAGAC 


TGAAGACCTG 


CTCTGGTTCG 


AAATAACGGC 


1560 


CCATGAGAAT 


TTGGCTGGCT 


CCAAAGAGTT 


CTTGTAGGGC 


GATAACAGAG 


TAGAGGAGAC 


1620 


TGGTATCCTT 


AATCACGGTA 


ACAAACTGAG 


AAATGATGGC 


TGGTAGCATT 


TTGCGGATGG 


1680 


CTTGTGGGAG 


AATGATGTAG 


TAGAGGATTT 


GGGCTGAGGT 


GAAGCCTTGT 


GACATTCCTG 


1740 


CTTCGTACTG 


TCCCTTGTCT 


ACGGCATTGA 


GACCGCCTCG 


AATAATCTCA 


GCCAAGGCTG 


1800 


CTGATGTAAA 


GAGAGTAAAG 


GCTGTAATAC 


CTGCTGGTGT 


GGATTTCATT 


TTGAACACCA 


1860 


AAAAGATAGT 


AAAAATCCAG 


AGAAGGTTGG 


GAACGTTGCG 


CACAAACTCG 


ATATAAATAC 


1920 


TGGAAATAAT 


GCGTAAGACA 


GGATTTTTGC 


CATTTCTCGT 


G AC AG C T AGC 


AC CGT AC C G A 


1980 


TGATAGTAGA 


GAGGATGATG 


GCAATCAGAG 


AAATATAGAG 


GGTCAAGCCA 


AATCCTTTAA 


2040 


AGATAAAGAC 


TAGGTTATCT 


GGGGTTAAAA 


CTTCTAAAAT 


AGATTCCATA 


GTAACCTCCT 


2100 


AAAGTGAATA 


GGCTTTTTTG 


TTGGCTTGCT 


CCATCTTGCG 


ACCAAACTGG 


GCAACAGGGA 


2160 


AGCATAGAGC 


AAAGTAGAGA 


AGAGCAGCAC 


CTAAAAAGGC 


TGGTATATAG 


TTTCCGTTGA 


2220 


GAGCCGACCA 


AG ACT T AGT C 


AC AAAC AT C A 


AGTCTACTCC 


AGAGATGATA 


GCTACAGTAG 


2280 


AGGTGTTCTT 


GATGAGGTTA 


ACAATTTGGT 


TGGTCAATGG 


AGGGAGAATG 


ATGCGGAAGG 


2340 


CCTGAGGCAA 


GATAATCAAG 


CGCATGGCAC 


TGATATAGGT 


AAAACCTTGC 


GACAAGGCGG 


2400 


CCTCCATCTG 


AC C ACT AGG A 


ATAGACTGAA 


TCCCTGAACG 


AATAACCTCA 


GCGATATAAG 


2460 


CGCCGTGATA 


GAGTCCCACG 


CAGAGAACGG 


CTGTCCAATA 


AATTGGAATC 


ATGATGATAT 


2520 
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GGTCACTGAT AAGAGGTAGG CCATAAAAAA CAATAACAAA CTGCACCAAG AGGGGAGTAT 2 580 

TTTGGTAAAA TTCAACAAAG ATGCGAGCTA AAATGCGTAA AATTGGACGT TTACTGGTTG 2640 

AC AT GG C AC C AAAGAAGATG CCCAAAACCA TAGCGAGGAT AAAGGAACCA ACCGCTAGGG 2 70 0 

CAAGGGTGAA GAGGAAACCA TTGAAAAATT GTCCAAAATC CTGAAAATAG GCTGTCCAAG 2 7 60 

ATGATAAATC TGTCATGGGG TGTCCTCCTT AATCTGCAGT ATGG CT AG AT GGTTTGAGCT 282 0 

TGTAACGGTC ATAAAGTTTC TGCAAACTAC CATCCTTGCT CCATTTAGTA AC C AAGTT AT 2880 

CAAGATAGTC GTTGAGCTCT GTATTTGATT TCTTGGTAAC AATACCGTAG TCAGATGGCT 2 94 0 

TGAAACTATC ATCTAGTAGT GCTGTCCGTT TACTAGTGTA GC C AG AT AG A ATAGAGCGGT 3 000 

CAACGGAAAA GGT AT CG AT A CGATGAGCGT GCAGGGAAGT AATCAATTCT GGGTAGGAAC 3 0 60 

CAAGTTCGAC GAATTTAAAC TTCAGACCTT TCTTTTTACC CAGTTCAGTA ATCAGGCGTT 312 0 

GGGTGATAGA ACCTTGGGCG ACT CCG ATGG TTTTGCCGTT TAGGTCCTCA ATCTTTTTGA 3180 

TTTTGGCAGA TTTATTGACC AAAAATC C AG AAGCGTCTGT GTAGTAGGGA CTGGTAAAGT 3 24 0 

TGTAGAGTTT TTTGCGTTCG TCCGTGATGG TAAAGGTCGC GATATCCATA TCGACCTGTT 3 3 00 

CATTGTCTAG AAGGGGGCCG CGGGTTTGTG CTGTAACCGG CACATAGCGA ATCTTG AC C T 3 3 60 

TGAGTTCATC AGCTACCATC TTGGCCAAGT CGGTTTCGAT ACCAGAATAA GTACCGGTCT 3 42 0 

TGGG AT CTTT GTAACCAAAA TTGGGAACGT CTTGTTTGAC ACCGACAACC AGTTCGCCTC 3480 

TTTTTTGAAT GTCTGCGATA CTTGTATCAG CCTGGACTGG TTTGGCAGCA GCAAGGCCGA 3 54 0 

AAAGGCTAAT CAATAATGCT GATAAAAAGA ATTTTTTTTC ATAGGCGCCT CCTTATTTGA 3 600 

CTTTGTCACT TTCGTGGTTG ATAATTTTGC TGAGGAATTG TTGGGCACGA GGTTCGCTTG 3 6 60 

GATTGTCAAA AAAGTTATCG ACATCTGTCG TATCTACTAA AACTTCTCCG TCGGCCATAA 3 72 0 

AG AT AATG CG GTCCGCAACC TCTCGAGCAA AGCCCATTTC GTGGGTAACG ATGATCATGT 3 780 

TCATCCCATC ATGCGCCAGT TTCTGCATAA CTGCTAGAAC ATCTCCGATA GTCTCAGGAT 3 840 

CAAGAGCAGA TGTTGGTTCA T C AAAGAGG A GGAGTTCCGG ATGCATAGCA AGACCACGAG 3 9 00 

CGATGGCGAT CCGCTGTTTT TGTCCACCAG ATAGCATGGC GGGATAGGAA TCTTTCTTGT 3 9 60 

CCCACATATT TACAAATTCC AGATATTTTT GGGCGGTTTT TTCAGCTTCT TTTTTATCAA 4 02 0 

TTCCTAGAAC TTCAATGGGT GCAAGCGTTA CGTTTTCTAA CACAGCTTTG TGTGGATAAA 408 0 

GGTTAAAATG TTGAAAAACC ATGCCGACTT CCTTGCGAAG AGGT AC C AAA TCTTTCTGGC 4140 

TGGCACCAGC AACTTGGTGC CCATTGACTA GG AGACTTCC TTTGTCAACA GTCTCTAAAC 42 00 

CAT T G AT CGT ACGGATAAGA GTGGACTTCC C AG AGC C AG A AGGTCCAAGC AGGACAACAA 42 60 

CTTGTCCTTT TTCAAAACGG AGATTGATGT TGCGGAATGC GTGGTAGTCT CCGTAATATT 432 0 
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TTTCGACGTT TTTAAATTCT ACTAAAGCCA TG AG AG AT C T CTATTGTGTT ATATTTTATA 4380 

ACACGGTTCT ACAATAAAAG AATGTTCTTG TCAAATCATA TCTGAAAAAA TT C ACT AT AG 4440 

TGAAATAAGA ACAGGAAAAA TCGATCGGGA CAGTCAAATC GATTTCTAAC AATATTTTAG 45 00 

AAGTAGAGGT GTACTATTCT AGTTTCAATA TACTATAAAA TGTTATAAAA AAGCAATCTG 45 60 

GATAGAGAAA ACGTCTAAAT CATGTTATAA TGAAGCAATA GAATTCTTAG AAAGAGTGGA 4 62 0 

TGTCTTTTTG ATAACACCTA CTTATGAATG GCAGTTTGCC CTGCAGGTAG AAGATGCGGA 4680 

TTTTACAAAG ATAGCCAAGA AGGCTGGACT GGGTCCTGAG GTGGCTCGGT TATTGTTTGA 474 0 

GAGAGGGATT CAGAACCAAG AAAGTCTGAA GAAGTTTTTA GAACCTTCCT TGGAGGACTT 4 8 00 

AC ATG AT GCT TATCTGCTCC ATGATATGGA CAAGGCAGTG GAGCGGATTC GTCAGGCTAT 4860 

TGAAGAAGGG GAAAATATTC TTGTTTATGG AGACTATGAT GCGGATGGCA TGACTTCGGC 4 92 0 

TTCTATTGTG AAGGAAAGTT TGGAACAACT TGGTGCTGAG TGCCGAGTTT ACCTGCCAAA 4980 

TCGTTTTACC GAT GG C T ATG GCCCTAATGC TAGTGTTTAT AAATACTTTA TCGAGCAAGA 5 040 

AGGGATTTCC TTGATTGTGA CGGTGGACAA TGGGGTTGCT GGTCATGAGG CTATTGCATT 5100 

GGCTCAGTCT ATGGGAGTAG ATGTCATTGT GACAGACCAT CATTCCATGC CTGAAACCCT 5160 

GCCAGATGCT T ATG CT AT T G TCCATCCTGA ACATCCAGAT GCGGATTATC CTTTTAAATA 522 0 

TTTGGCTGGT TGTGGAGTTG CTTTCAAGTT GGCTTGTGCC CTGTTAGAAG AAGTGCAAGT 52 80 

GGAATTGCTT GATTTGGTCG CTATTGGAAC TATTGCAGAT ATGGTGAGTC TGACGGATGA 53 4 0 

AAATCGTATC TTAGTTCAAT ATGGTCTGGA AATGTTGGGT CATACCCAGC GCATTGGTCT 54 00 

GCAAGAAATG CTGGACATGG CTGGGATTGC TGCCAACGAA GTAACAGAAG AAACGGTTGG 54 6 0 

TTTCCAGATT GCTCCTCGTT TGAATGCCTT GGGTCGCTTG GATGATCCCA ATCCTGCCAT 5520 

TGATTTGTTG ACTGGATTTG ATGATGAGGA AG CG C ATG AG ATTGCCCTTA TGATTCACCA 55 8 0 

GAAAAACGAA GAGCGCAAGG AAATCGTTCA GTCTATCTAT GAAGAAGCCA AG AC C ATG G T 5 64 0 

GGATCCTGAG AAGAAGGTTC AGGTCTTGGC CAAGGAAGGC TGGAATCCTG GGGTTCTAGG 5700 

AATCGTGGCT GGTCGTTTAT TGGAAGAATT GGGACAGACA GTCATTGTTC TT AATAT AG A 57 60 

AGACGGTCGT GCCAAGGGCA GTGCTCGTAG TGTGGAAGCG GTCGATATTT TTGAAGCTCT 582 0 

GGATCCCCAT CGAGACCTCT TCATCGCCTT TGGAGGTCAT GCAGGTGCAG CGGGTATGAC 5880 

GCTGGAAGTT GAGCAACTCT CAGATTTATC TCAGGTTTTG GAAGATTATG TTCGTGAAAA 5 940 

AGGTGCAGAT GCTGGTGGCA AGAATAAGTT AAAC C TAG AT GAAGAGTTGG ATTTGGAGGC 6000 

ACTTAGCTTG GAAACGGTCA AAAGTTTTGA ACGTTTAGCT CCTTTTGGAA TGGATAATCA 6060 
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GAAACCTATT TTTTATATCA AGAATTTTCA GGTCGAAAGT GCTCGTACTA TGGGGGCAGG 612 0 

TAATGCCCAT CTAAAGCTGA AAATTTCCAA GGGTGAGGCG AGTTTTGAAG TGGTAGCCTT 6180 

TGGTCAAGGC AGATGGGCGA CAGAGTTTTC TCAAACCAAG AATCTAGAGT TAGCGGTTAA 62 4 0 

ATTGTCTGTC AACCAATGGA ATGGCCAAAC TGCCCTCCAG TTGATGATGG TGGATGCGCG 6300 

AGTGGAAGGT GTTCAACTTT TTAACATTCG TGGAAAAAAT GCAGTCTTGC CAGAAGGTGT 63 60 

TCCAGTCTTG GATTTTCCTG GAGAACTGCC AAATCTTGCG GCTAGTGAAG CTGTTGTCGT 642 0 

AAAAAACATT CCAGAGGATA TTACTCAGCT GAAGACCATT TTTCAGGAAC AGCATTTCTC 64 80 

TGCTGTCTAT TTCAAAAATG ATATTGACAA GGCTTATTAT CTGACAGGTT ATGGGACTAG 6540 

AGATCAGTTT GCCAAATTGT ACAAGACTAT TTACCAGTTC CCAGAGTTTG ATATTCGCTA 6 600 

CAAGCTGAAA GATTTGGCTG CATATCTTAA TATTCAACAA ATCTTGCTGG TCAAGATGAT 6660 

TCAAGTATTT G AAG AAC TAG GCTTTGTGAC GATAAAAGAT GGTGTGATGA CAGTCAATAA 672 0 

AGAGGCGCCA AAGCGGGAGA TAGGAGAAAG TCAAATTTAC CAAAATCTCA AACAAACCGT 67 80 

TAAAGACCAA GAAATGATGG CGCTGGGTAC GGTGCAAGAA ATTTATGATT TTTTGATGGA 6840 

AAAAGAGTAG AAGTTAGGAA AGAGTTGGGA AATCAACTCT TTTTTGAAAA CAGACCTTCA 6900 

TTTTGAAAAT CATCAAAAAA ATGGTATAAT GGTAGGAAAA GATTCGGCTG AAAGTATCAG 6960 

AACTTTTAGA ATAAGAGGGT AGAATTGCCC TATAATCAAG AT AAAC T AAG ATTTTGGAGG 702 0 

AAAAATGAGT AATATCAGTT TAACAACACT TGGTGGTGTG CGTGAGAATG GAAAAAATAT 70 8 0 

GTACATTGCT GAAATTGGAG AGTCCATTTT TGTTTTGAAT GTAGGGTTAA AATATCCTGA 7140 

AAATGAACAA TTAGGGGTCG ATGTGGTGAT TCCAAACATG GATTACCTTT TTGAAAATAG 72 00 

CG AC C GT ATT GCTGGGGTTT TCTTGACCCA CGGGCATGCG GATGCCATTG GTGCTCTACC 72 60 

GTATCTCTTG GCAGAGGCTA AAGTTCCTGT ATTTGGGTCT GAGTTGACCA TTGAGTTGGC 7320 

AAAGCTCTTT GTCAAAGGAA ATGATGCCGT TAAGAAATTT AATGATTTCC ATGTCATTGA 7 3 80 

TGAGAATACG GAGATTGATT TTGGTGGGAC AGTGGTTTCC TTCTTCCCTA CGACTTACTC 7440 

CGTTCCAGAG AGTCTGGGAA TTGTCTTGAA GACATCGGAA GGAAGCATCG TTTATACAGG 7 500 

TGACTTCAAA TTTGACCAAA CGGCTAGTGA ATCTTATGCA ACTGATTTTG CTCGTTTGGC 7 5 60 

AGAGATTGGT CGTGACGGCG TCCTGGCTCT CCTCAGTGAT TCGGCCAATG CAGACAGCAA 7 62 0 

TATTCAGGTG GCTAGTGAAA GTGAAGT T AG GGATGAAATT ACCCAAACTA TTGCTGACTG 7 680 

GGAAGGTCGT ATCATCGTTG CAGCTGTTTC CAGTAATCTT TCTCGTATTC AGCAGATTTT 7740 

TGACGCTGCG GATAAAACAG GTCGACGTAT CGTCTTGACA GGATTTGATA TTGAAAATAT 7 800 

CGTCCGCACA GCGATTCGTC TTAAGAAGTT GTCTTTAGCC AACGAAATTC TTTTGATTAA 7860 
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GCCTAAAGAT ATGTCTCGCT TTGAAGACCA TGAGTTGATT ATTCTTGAGA CAGGTCGTAT 7 92 0 

GGGTGAGCCT ATCAATGGAC TTCGTAAGAT GTCGATTGGT CGCCATCGTT ATGTAGAAAT 798 0 

CAAGGATGGG GACCTAGTCT ATATTGCTAC GGCTCCGTCT ATTGC T AAAG AAGCCTTTGT 804 0 

TGCGCGTGTG GAAAATATGA TTTATCAGGC AGGTGGGGTT GTCAAATTGA TTACCCAAAG 8100 

T T T AC AT GT A TCAGGGCACG GAAATGTGCG TGATTTGCAG CTGATGATCA ATCTTTTGCA 816 0 

ACCTAAGTAC CTCTTCCCTG TCCAAGGGGA GTATCGTGAG TTGGATGCTC ACGCTAAGGC 822 0 

TGCCATGGCA GTTGGGATGT TGCCAGAACG CATCTTCATT CCTAAAAAGG GG ACG AC CAT 8280 

GGCTTACGAG AATGGAGACT TTGTTCCAGC TGGATCGGTT TCAGCAGGAG ATATCTTGAT 83 4 0 

TGATGGGAAT GCCATTGGTG ATGTTGGAAA TGTTGTTCTT CGTGACCGTA AGGTCTTGTC 8400 

AGAGGATGGA ATTTTCATCG TGGCTATTAC AGTCAACCGT CGTGAGAAGA AAATTGTGGC 84 60 

TAGGGCTCGT GTTCACACGC GTGGATTTGT TTATCTCAAG AAGAGTCGCG ATATTCTCCG 8 52 0 

TGAAAGTTCA G AATTG AT T A ACCAAACGGT AGAAGAGTAT CTTCAAGGAG ATGACTTTGA 858 0 

CTGGGCAGAT CTCAAAGGTA AGGTTCGTGA CAATCTGACC AAGTACCTCT TTGATCAAAC 8 64 0 

CAAGCGTCGC CCAGCCATTT TACCAGTAGT CATGGAAGCA AAATAATCGT TGAAATAAAC 87 00 

AGAGAGAAAG TCGAGTTTCG GCTTTTTCTT ATAGAAAAAT AGAAGGAGAA AATCATGGCA 87 6 0 

GTGATGAAAA TCGAGTATTA CTCACAAGTA TTGGATATGG AGTGGGGGGT GAATGTCCTC 8 82 0 

TACCCTGATG CCAATCGAGT GGAAGAACCA GAGTGTGAAG ATATTCCCGT CTTGTACCTT 8880 

TTGCACGGGA TGTCTGGAAA TCATAATAGT TGGCTTAAGC GGACCAATGT AGAACGCTTG 8940 

CTTCGAGGAA CTAATCTCAT CGTTGTTATG CCCAATACCA GCAATGGTTG GTACACCGAT 9000 

ACCCAGTATG GTTTTGACTA CTACACGGCT CTAGCAGAGG AATTGCCACA GGTTCTGAAA 9 0 60 

CGCTTCTTCC CTAATATGAC GAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATG 912 0 

GGAGGCTACG GCTGCTTCAA ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAGCTAGT 9180 

TTTTCAGGTG CCCTCAGCTT TCAAAACTTT TCTCCTGAAA GTCAAAATCT GGGAAGTCCA 92 4 0 

GCCTACTGGA GAGGTGTTTT TGGAGAGATT AGAGACTGGA CAACTAGTCC CTATTCTCTT 93 00 

GAAAGTCTGG CTAAAAAATC GGATAAAAAG ACCAAACTTT GGGCGTGGTG TGGCGAACAG 93 60 

GATTTCTTGT ACGAAGCCAA TAATCTCGCA GTGAAAAATC TCAAAAAACT AGGTTTTGAT 942 0 

GTGACCTATA GCCATAGCGC TGGAACTCAC GAGTGGTACT ACTGGGAAAA ACAATTGGAA 9480 

GTTTTTTTAA CAACCCTACC AATTGATTTC AAATTAGAAG AGAGACTGAC TTAGTTTGAA 954 0 

CTTCAGCATA GGGGGAGTAG AACTAAAATA AAATATGTTT TCACTAGACT TTTCAAACGm 9600 
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AAGTAGTAGA ATAGTAATAA AAT ACT GG AG G AAAG AG AG T AGGAAATGTA CCGTTATCAA 9660 
ATTGGCATTC CCACATTAGA ATATGATCAG TTTGTCAAAG AACATGAATT AGCCAATGTA 9 72 0 

TTACAAAGTA GTGCTTGGGA GGAAGTTAAG TCTAATTGGC AACATGAGAA GTTTGGTGTT 97 80 

TACAGGGAAG AAAAATT AC T GGCGACAGCT AGTATTTTGA TTAGAACTCT TCCGCTAGGC 9 840 

TATAAAATGT TTTACATCCC AAGAGGACCT ATATTGGATT ATGGGGATAA AGAACTCTTG 990 0 

AATTTTGCCA TTCAGTCTAT TAAGTCCTAT GCTCGCAGTA AGAGAGCGGT TTTTGTGACT 99 60 

TTTGACCCAA GTATTTGCCT ATCTCAAAGT TTAATCAATC AGGAAAAGAC AG AAT TT C C T 10020 

GAAAATCTGG CTATTATTGA TAGTTTGCAA CAAATGGGAG TAAGGTGGTC AGGAAAAACG 1008 0 

GAGGAAATGG GAGACACCAT TCAACCTCGT AT T C AG G CG A AAATATACAA GGAAAATTTT 10140 

GAAGAAGATA AACTTTCCAA GTCAACAAAA CAGGCTATTC GAACAGCACG AAACAAAGGG 10200 

CTTGAGATTC AATATGGTGG ACTGGAACTA TT AG ATT CAT TTTCGGAGTT GATGAAAAAA 10260 

ACTGAGAAGC GAAAAGAGAT TCATTTGAGG AATGAAGCCT ATTATAAAAA ATTG T TAG AT 10320 

AATTTTAAGG ACAAGGCCTA TATCACCTTG GCCACCTTGG ATGTTTCTAA ACGTTCGCAA 103 8 0 

GAGTTAGAAG AACAGTTAGC GAAAAATAGA GCCTTGGAAG AGACCTTTAC TG AGT C G ACT 1044 0 

CGAACTTCAA AAGTAGAAGC GCAGAAGAAG GAAAAAGAAC GTTTGTTAGA GG AATTG AC C 10500 

TTCTTGCAGG AATATATAGA TGTAGGTCAA GCGAGAGTTC CTTTAGCGGC TACTTTGAGT 105 60 

TTGGAATTTG GTACTACCTC TGT C AAT AT A TATGCTGGTA TGGATGATGA TTTTAAACGT 10620 

TACAATGCAC CAATTTTAAC TTGGTATGAA ACGGCTCGCT ATGCCTTTGA ACGAGGTATG 10 680 

ATCTGGCAAA ATTTAGGTGG TGTTGAAAAC TCTCTCAATG GTGGACTTTA TCATTTTAAG 1074 0 

GAAAAATTTA ATCCAACGAT TGAAGAATAC TTGGGTGAAT TTACAATGCC CACTCATCCT 10 80 0 

CTCTATCCTC TGTTAAGACT TGCTCTTGAT TTCCGTAAAA CATTAAGAAA AAAACATAGA 108 60 

AAGTAAGTAT ATGGCACTAA CAACACTCAC GAAAGAAGAG TTTCAGACTT ATTCTGATCA 109 20 

GGTTTCTTCT CGTTCCTTTA TGCAATCTGT CCAGATGGGG GATTTGCTAG AAAAAAGAGG 10980 

GGCTCGAATT GTTTATCTTG CTTTGAAACA AGAAGGAGAA ATTCAAGTTG CAGCTCTGGT 11040 

TTATAGCCTG CCCATGCTGG GTGGTCTGCA T AT GG AAC T C AATTCGGGGC CGATTTATAC 11100 

CCAACAAGAT GCTCTTCCAG TTTTTTATGC AGAGTTAAAA GAATATGCCA AGCAAAATGG 11160 

T GT AT TAG AG TTGCTTGTAA AACCCTATGA AACTTATCAA ACTTTTGATA GCCAAGGTAA 112 2 0 

T C C AAT AG AT GCTGAGAAAA AAAGT AT TAT TCAAGATTTG ACTGATTTAG GTTATCAATT 112 80 

TGATGGCTTA ACAACAGGTT ACCCAGGTGG AGAACCAGAT T GGTT AT AC T AT AAAG ATT T 11340 

AACTGAATTA ACTGAAAAGA GTTTGCTTAA AAGTTTTAGC AAAAAGGGTA AACCCTTGGT 114 00 
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G AAAAAGG C T 


GAAACCTTTG 


GCATTCGGTT 


G AAAAAG TT A 


AAACGTGAAG 


AACTATCGAT 


11460 


TTTTAAGAAT 


ATAACAAAAG 


AAACCTCTGA 


ACGTAGAGAA 


TATAGTGATA 


AAAGT T TAG A 


11520 


ATATTATGAG 


CATTTTTATG 


ATACTTTTGG 


AGAACAAGCG 


GAGTTTCTCA 


TAG C AAGC TT 


11580 


AAATTTTTCG 


G AC T AT ATG A 


G C AAATTG C A 


AGGTGAACAA 


AGTAAACTAG 


AAG AAAACT T 


11640 


GG AC AAGT TG 


CGACTTGATT 


TGAGTAAAAA 


TCCTCATTCT 


GAGAAAAAAC 


AAAATCAACT 


11700 


GAGAGAATAT 


TCTAGTCAAT 


TTGAAACGTT 


TGAAGTTCGA 


AAAGCAGAAG 


CGCGAGACTT 


11760 


GATTGAAAAA 


TATGGAGAAG 


AAGATATTGT 


TTTAGCTGGG 


AGTTTATTTG 


TTTATATGCC 


11820 


TCAGGAAACG 


ACTTATCTCT 


TTAGTGGTTC 


CTACACTGAG 


TTTAATAAGT 


TCTATGCCCC 


11880 


TGCACTGCTT 


CAAAAATATG 


TTATGTTGGA 


AAGCATAAAA 


CGTGGAATAC 


CTAAATACAA 


11940 


CTTCCTAGGC 


ATTCAAGGGA 


TTTTTGATGG 


AAGTGATGGT 


GTTTTGCGTT 


TTAAACAGAA 


12000 


TTTTAATGGC 


TATATTGTAC 


GCAAAGCAGG 


TACTTTCCGT 


TACCATCCAT 


CGCCTTTAAA 


12060 


ATACAAAGCT 


ATCCAGTTAC 


TCAAAAAAAT 


AGTAGGACGT 


T AAG AT G AAA 


AAGTCAGTAT 


12120 


TTAGATTTCT 


TTTAGCTTCT 


TTTAGTAAAA 


TAATTCTTAT 


TTG C TAG AAA 


GG T GG AG AG A 


12180 


CATGCGCTGG 


CTTTTTCGTT 


TGATAGGGGC 


TTTCTTTTCT 


TTTGTGTGGC 


GTTTGTTTTG 


12240 


GCGTCTGGTT 


TGGATAGTTG 


TGCTCTTATG 


TGTGCTTGCT 


TTCGGACTTC 


TCTGGTATCT 


12300 


GAACGGAGAT 


TTTCAAGGAG 


CGCTAAAGCA 


AGCAGAACGG 


TCAGTAAAAA 


TTGGTCAACA 


12360 


AAGT AT TG AC 


CAATGGGAGA 


AAACAGGGCA 


ACTGCCTAAG 


TTAAGCCAGA 


C AG AT AG T C A 


12420 


CCAGCATTCT 


GAAGGAAGGT 


GGGCACAGGC 


CTCTGCTCGT 


ATTTACCTGG 


AT C CG C AG AT 


12480 


GGATTCACGC 


TTTCAAGAGG 


C T T AT TT AG A 


AGCAATCCAG 


AACTGGAATC 


AAACTGGTGC 


12540 


TTTTAACTTT 


GAACTCGTGA 


CTGAGTCTAG 


TAAGGCGGAT 


AT T AC GGCT A 


CGGAGATGAA 


12600 


C G ACGG AGGC 


ACTCCTGTGG 


CAGGAGAGGC 


GGAAAGTCAA 


ACTAATCTCT 


TAACAGGGCA 


12660 


ATTCTTGTCC 


GTAACGGTGC 


GGTTGAATCA 


TTATTATTTG 


TCCAATCCAT 


ACTATGGCTA 


12720 


CTCCTATGAA 


CGCCTTGTCC 


AT ACGG C AG A 


ACATGAGTTA 


GGTCATGCGA 


TTGGCTTGGA 


12780 


CCATACAGAT 


GAGAAGTCTG 


TCATGCAACC 


AGCAGGTTCC 


TTTTATGGTA 


TCCAGGAAGA 


12840 


GGATGTTGCA 


AACCTCCGAA 


AAATATATGA 


GACTAGTGAG 


TAGGGTACTA 


TCTTTCCCTA 


12900 


CTTTTTTTGC 


TATAATGGAA 


CTATGAACAA 


CTTGATTAAA 


TCAAAACTAG 


AGCTCTTGCC 


12960 


GACCAGCCCT 


GGTTGCTACA 


TTCATAAGGA 


TAAAAATGGC 


ACCATTATCT 


ATGTAGGAAA 


13020 


GGCTAAAAAT 


CTGCGTAATC 


GAGTACGGTC 


CTATTTTCGT 


GGAAGTCATG 


AT AC C AAG AC 


13080 


AGAGGCTCTG 


GTGTCTGAAA 


TTGTGGATTT 


TGAATTTATT 


GTTACGGAGT 


CTAATATTGA 


13140 
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GGCACTTCTC CTAGAAATCA ACCTGATCAA GGAAAACAAG CCCAAGTACA ATATCATGCT 132 00 

CAAGGATGAC AAGTCCTATC CTTTCATCAA AATCACCAAT GAGCGCTATC CACGCTTGAT 132 60 

TATCACTCGT CAGGTCAAAA AGGACGGAGG TCTTTATTTT GGACCCTATC CCGATGTGGG 1332 0 

GGCAGCCAAT GAAATCAAGC GGTTGCTGGA TCGGATATTC CCTTTTCGTA AGTGTACCAA 13 3 80 

CCCGCCCTCT AAGGTCTGTT TT T ATT AC C A TATCGGCCAG TGTATGGCCC ACACCATCTG 13440 

TAAGAAGGAT GAGGCTTATT TCAAGTCTAT GGCCCAGGAG GTGTCTGATT TTCTGAAAGG 13 500 

TCAGGATGAC AAAATCATCG ATGATCTCAA GAGTAAAATG GCAGTAGCAG CACAAAGTAT 13 560 

GGAGTTTGAA CGTGCGGCGG AATACCGTGA CCTGATTCAG GCTATTGGAA CGCTTCGAAC 13 62 0 

CAAGCAACGG GTCATGGCGA AAGATTTGCA AAATCGCGAT GTCTTTGGCT ACTATGTGGA 13 680 

TAAGGGCTGG ATGTGTGTGC AGGTTTTCTT TGTCCGTCAG G t AAGCTCAT CGAGCGCGAT 13 74 0 

GTCAATCTCT TCCCCTACTT CAATGATCCA GATGAGGATT TTTTGACCTA TGTAGGACAA 13 800 

TTCTATCAAG AAAAATCTCA TCTAGTTCCC AATGAGGTAC TGATTCCGCA GATATTGACG 13 8 60 

AAGAAGCTGT CAAGGCTTTG GTGGATTCCA AGATTCTTAA GCCTCAACGT GGAGAGAAAA 13 920 

AACAACTGGT CAATCTAGCC ATAAAAAATG CTCGTGTTAG T C TAG AG C AG AAGTTCAATC 13 9 80 

TGCTAGAAAA ATCTGTCGAA AAGACTCAAG GAGCTATTGA AAATCTAGGG CGTTTGCTCC 14040 

AAATCCCGAC CCCAGTACGT ATCGAGTCCT TCGATAACTC T AAT AT CAT G GGAACTAGCC 14100 

CTGTTTCGGC TATGGTGGTC TTTGTCAACG GTAAACCGAG TAAGAAGGAT TACCGTAAGT 14160 

ACAAGATAAA AACGGTTGTT GGACCAGACG ACT AT GCC AG CATGAGAGAG GTCATTCGCA 1422 0 

GACGCTATGG TCGAGTACAG CGTGAGGCTT TGACTCCTCC AGATTTGATT GTGATTGATG 1428 0 

GGGGGCAAGG TCAAGTCAAT ATCGCTAAGC AGGTTATCCA AGAGGAACTG GGCTTGGATA 143 40 

TTCCAATTGC TGGGCTGCAA AAGAATGATA AGCACCAAAC CCATGAATTG CTCTTTGGAG 14400 

ATCCGCTTGA GGTGGTGGAT TTGTCTCGCA ATTCTCAGGA ATTTTTCCTC CTCCAACGCA 144 60 

TCCAAGATGA GGTGCACCGC TTTGCTATCA CTTTCCACCG CCAACTGCGC TCCAAAAATT 14520 

CTTTCTCATC TCAATTGGAT GGGATTGACG GTCTGGGACC TAAACGCAAG CAGAATCTTA 145 8 0 

TGAAGCATTT CAAGTCTTTG ACCAAAATCA AGGAAGC C AG TGTGGATGAG ATTGTCGAAG 14 640 

TTGGGGT AC C TAGAGTCGTT GCAGAGGCTG TGCAAAGAAA GTTGAACCCG CAGGGAGAAG 147 00 

CCTTGCCTCA AGTAGCAGAA GAAAGAGTAG ATTACCAAAC GGAAGGAAAC CACAATGAAC 147 6 0 

CATAAAATCG CAATTTTATC AGATGTTCAT GGCAATGCGA CGGCGCTAGA AGCAGTGATT 14 82 0 

GCAGATGCTA AAAATCAAGG GGCCAGTGAA TATTGGCTTC TGGGAGATAT TTTTCTTCCT 148 80 

GGTCCAGGCG CAAATGACTT AGTCGCCCTG CTAAAGGACC TTCCTATCAC AG C AAGTGT T 14 940 
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CGAGGCAATT GGGATGATCG TGTCCTTGAG GCTTTAGATG GGCAATATGG CTTAGAAGAC 15000 

CCACAGGAAG TTCAGCTCTT GCGTATGACA CAGTATTTGA TGGAGCGAAT GGATCCTGCA 150 60 

ACGATTGTCT GGCTACGAAG CTTGCCTTTG CTGGAAAAGA AAGAAATTGA CGGATTGCGC 1512 0 

TTTTCTATCT CTCATAATTT ACCTGACAAA AACTATGGTG GTGACTTGCT AGTTGAGAAT 15180 

GATACAGAGA AATTTGACCA AC TG C TAG AT GCGGAAACGG ACGTGGCAGT TT ATGGT CAT 152 4 0 

GTTCACAAGC AGTTGCTTCG TTATGGAAGT CAAGGGCAAC AAATCATCAA TCCAGGGTCG 15300 

ATTGGCATGC CCTATTTTAA TTGGGAGGCG TTAAAAAATC ACCGTTCCCA GTATGCCGTG 153 60 

ATAGAAGTTG AAGATGGGGA ATTACTCAAT ATCCAATTTC GTAAAGTTGC TTATGATTAC 1542 0 

GAAGCTGAGT TAGAATTGGC CAAGTCCAAG GGGCTTCCCT TTATCGAAAT GTATGAAGAA 15480 

CTGCGTCGTG ACGATAACTA TCAGGGGCAC AATCTGGAAT TATTAGCCAG CTTAATAGAA 15540 

AAGCATGGGT AT GT AG AGG A TGTGAAGAAT TTTTTTGATT TTTTGTAAGA GTTTCCTAAA 15600 

AT AG C C AATG CAAACTAAAA AAGCGATTTG CTGGTCCAAT CGCTTTTAGT ATATCTTATA 15660 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA GGTTGCTCAA AG C AC AGCT T 15720 

TGAGGTTGCA GATAAAGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTGT TATTGTAACT 157 8 0 

GAGATTGATC TGGGAGGTAA GAACCACCTA GATAGGTATT GCTGAGTTTT TCAAGGGTTC 15840 

CGTCTTGATA GAGTTCTTTG AGCGCTTTAT CAAATTGCTC TTTAAACTCT TTTTGGTCGC 15 9 00 

T TG AG AAAAT GATATAATTG CTGGGGCTAT CTGCAGAAGG TAAATCAACG ACTGAGAGGT 15 9 60 

CTAAACCACG GTCCTTGATA ATCTTTTGAA CGGATACCTT GTCAAAAACT AGGAAATCAA 1602 0 

ACTCTCCGTT AGCAAGGTCT AGGATTCGTT TACCAATATC CTCACCAGAA AAATTAATTG 16 080 

TAGCGGGATT ATCAGTGTGT TTCTGATTCC AGTTATTGAT GAATTGAGCG TTAGAAGTTC 1614 0 

CGGTATCCTC TTGTGTTGTT TTACCAGCGA TCTGGTCAAG AGAAGTCAAA GGATTTTTCT 16200 

TGTTGCTGAC AAGGACGAGG GGATTGTTGG AAATTGGAAG CGAGTAAAGG TATTTTTCAG 162 60 

CACGCTCTTT TGTGTAACTC AAGTTATTGG CCGCAGCCTG ATAGTGACCA GAATCAAGTC 16320 

CTGGGAAGAT GCTCTCCCAG GCGGTTCTTT GGAATTGAAT CTCGTAGTCG CTGAGTTTTT 16380 

CATCTACTGC CTTTAAAACT TCGATATCAA AGCCTGTCAG ATTGCCCTTG TCTTCGTAGT 16440 

CAAATGGTGG CACGTCGCCA GCTGTAGCAA GGACGATTGT CTTTTGAGCG CTAGTCTCTT 16500 

TGGGTGTAGC TTGATTCTCA CAGGCAACCA AAAATGGTAG GATAGCTAGT AATAGGCTAA 165 60 

ATTTTTTCAT ACTGTCTCCA TTCAAATGTA AAG 16593 
(2) INFORMATION FOR SEQ ID NO : 53: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 510 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GGGATATCCT TATATCCTTG TTCCTGGAAC CATTGTGGGA ATTGCTCAAC AGTTTTTTCA 6 0 

CCTTGAATTC CTGGTGCAAT GACAGTAAGA ATTTCGAAAT C AC G AT CTGG TTTCGCCGCT 12 0 

AGTTCCATCA ACTCTGGCAT ACTTTTCTTG C ATGG AC C AC ACCATGAAGC CCAAAACTTC 180 

AAGTAAACCT TTTTACCCTT AAAATCAGAT AACTTAACTT CTTTGCCATC CATGGATTGC 240 

AATGTGAAGT CTGGAGCATC TTTTCCAACA GCAATTTGTT GTACAGTCGT TTGTTGTTTT 3 00 

GGCTGTTGTG CTGCTTGAGT CTTTTTAGTT TCTTCCTCAC CACAGGCCAT CAATACAACT 3 60 

AATGACAAGA GACTTAAGCC AGCAAACATT ACTTTTTTCA TTTGTCCTCC TTTATTCAAA 420 

AATTCCAGCT AGAACATTTA CTTGTCCTAA TAGTAACAAA ATTCCCATTA AAACAATGAG 4 80 

GAAACCACCA ATTTTCTTTA GT AGC AT CAT ATGACGCTTG ATTTTACTAA AATATGGCAT 54 0 

GACTAGACCT GAAGCTAGTG CCAATACCAA GAAAGGAAGG GCCATGCCaG AGTGTAAATG 600 

AGAGTATAAA TCGCTCCTTG CCAAGCGCCA TTGCCTCCAG AAGCCGCAAG TGCTAAAACA 660 

GAACTTAAAA CTGGACCAAT ACAAGGTGTC CAACCAAAGC TAAAGGTAAT ACCAAGTAAA 72 0 

AAAGCTGACC AATAACGATT AGAATCTGAT TTTTTAAAGG TAAAACTTTT TTGAACTTCT 780 

AATTTCTTCA AATGAAAAAT TTCCATCTGG TGAAGACCCA AAATGATAAT AATAGCTCCC 84 0 

ATGCCATATC , GAAACCAATT T G CAT AG AG A ATATGACCAA AGTAACCAGC ACCAAAGCCT 9 00 

AGAATAAAGA AAATGAGAGA GATACCAGCG AT AAAG C AAA GTGTTCGAAT CAAGCCTGAC 960 

CAGAGAACCT TTCTCCCAAA CAAAGAAAAG CTTTTTGCAC TTTCTTGATC AT C C AAT AAA 102 0 

AT C CC AGC AT AGACTGGCAG AAGAGGAAAA ATACAAGGAG AAAAAAAGGA TAAAACACCT 1080 

GCTAGAAAAA CAGAGATTAA AAATACTATC GTTTCCAATA AAGAACCAAC TTTCTTAATA 114 0 

ATTCTAATCC TATTTTACTA TATTCAATTT TATTTGTAAG CTTTCTGCTA CGCAAAATCG 12 00 

TATCGGGCAC TATTGGACCA ATCTTTTCTT TTGCTAGTCA AGGCGGATCT TATCCCCCAA 12 60 

AATAGCCAAA AAGCAACGAC AAGGATTACT CATCGCTGCT TTTGTGAACG AAAATGTCTT 132 0 

TTAGGTCTGA CATTTCATAA ATCATGTTTT ACTTGAGTTT GTCAAGGATT GCTTTAAGCT 13 8 0 

CCTCTACTAG TTTAGTTTCT GTCTCTGCTG AGCCATTTTC TTCTTTCACG AAATCAAGGG 14 40 

TTTCTTGGAG AAGGTTTTGG GCTTTGGCAA GGACTTTTTT ATCCGCTTTT TCTGCATCTA 1500 
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GCTGTCCTAG AACCTTGATC AATTCCGTGC TTAATTGCTG GATTTCTGAC TCTTTCTTAC 1560 

GGCGAATCAG CCAGAAGGCA ATCACGCCTA GGAGGGCAAG TAGACTGACC ACAATCACTC 162 0 

CTGCCGGAAC TGAGTTTGTT TCAGTCATCT TATCTGAATC CTTACTATCT TCCGTTCCTT 1680 

GTTTTGCATC CTTCTTGTCC TGTGCAGGCT TGCTGTCGCT AGCATTTGCT TTCACATCTT 1740 

TGAGAGAGTC CAAGGCAGCC CAGCCTTCAC AGACTCTACT GCAGTATGCA G AC C TT ACT C 1800 

TGTCAAGGCA CTATCTTCCG GAGCTTTTTG AGCATCTAGG AGGACAGCCT TGGTTGCATC 18 60 

GATTTTCGGA TCAGATACTG TTGCCAAAGC TTTCAAGCGT TGGTCTAACT CTTGACTCAA 192 0 

GGCACGAAGT TCAGACTTGT CAACTTGCTC TTGAGCTTGT GTGCTCGTTG AG CT AG C CG A 1980 

AGCGCTTGCT ACCACTCTAG GATCTTGAGT CGGAGCTGAG CTTGGAGCTG GGACAGGGCT 2 04 0 

TGCAGGTTGA CTAGGAACAG TTATGGTATA TTGAAACTAG AATAGTACAT ATGGACTTCT 2100 

AAAACATTGT TAGAATTCGA TTTTACTGTC CTGATCGATT TGTCCTATTC TTATTTCATT 2160 

T T ACT AT AAT AACCGATGGT GTGGTTAATG TTGGTAAGAG AAACTTCTGA AACCAAGCTT 2 22 0 

CAAAAAAGTC GCTCGTCATC GTCTCTTCGT AAGTCATTGG AGCGATTAAT TCACCATTTG 2 2 80 

TTAGACCTGC AACCAAAGAA ATCCTCTGAT ATCTTCTTCC AGATACTTTG CCTCTTATTA 2 34 0 

ACTGACCTTT TAATGAGCGA CCATATTCTC GATAAAAATA AGTATCGAAT CCTGTTTCGT 2 4 00 

CAATCTAAAC AGGTGCTAGG TGCTTTAAAC TATTAAAATT CTTAAGAAAT AAGGCTACTT 2 460 

TTTCTGGGTC TTGTTCATAG TAGGTGTGGT TCTTTTTTTC GAGTGTAGCC CATAGCTTTG 252 0 

AGCGCATAGT GGATGGTAGT TGGATGACAG CCAAAkTCAG AAGCTATTTC AGTCAAATAA 2580 

GCrTCTGGAT TGTCAGTAAG ATAGTTTTTA AGTCTATCTC TATCAACTTT TCTTGGTTTT 264 0 

GTTCCTTTTA CTTGGTGGTT TAGCTCTCCT GTTTTCTCTT TTAGCTTTAA CCAGCCATAA 2 700 

ATGGTATTAC GTGAGATTTG GAAAACGTGT GATGCTTCTG T TAT ACT AC C TATTCGCTCA 2 7 60 

CAATAAGAGA GAACTTTTTT ACGAAAATCT ATTGAATATG CCATAAGAAG ATTATACCAC 282 0 

ATTGTGTACT ATTTTTGGTT CATTTCACTA TAACACAAAA TAGATTATTA TTACATAACA 2 880 

AAAAAGAGGT CTAAACCTCT TAACTCAATT ACTCCGCCAG T AGG ACT C G A ACCTACGACA 2 940 

TCATGATTAA CAGTCATGCG C TACT AC C AA CTGAGCTATG GCGGATTAAA GCTAAGCGAC 3000 

TTCCCTATCT C AC AGGGGG C AACCCCCAAC TACTTCCGGC GTTCTAGGGC TTAACTTCTG 3060 

TGTTCGGCAT GGGTACAGGT GTATCTCCTA GGCTATCGTC ACTTAACTCT GAGTAATACC 312 0 

TACTCAAAAT TG AAT AT C T A TTCAATTTAA GAAAACCGTT CGCTTTCATA TTCTCAGTTA 3180 

CTTTGGATAA GTCCTCGAGC TATTAGTATT AGTCCGCTAC ATGTGTCGCC ACACTTCCAC 3240 
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TTCTAACCTA TCTACCTGAT CATCTCTCAG GGCTCTTACT GATATATAAT CATGGGAAAT 33 00 

CTCATCTTGA GGTGGkTtCA C ACT T AG ATG CTTTCAGCGT TTATCCCTTC CCTACATAGC 3 3 60 

TACCCAGCGA TGCCTTTGGC AAGACAACTG GTACACCAGC GGTAAGTCCA CTCTGGTCCT 342 0 

C T CGTAC TAG GAGCAGATCC TCTCAAATTT CCTACGCCCG CGACGGATAG GGACCGAACT 3480 



GTCTCACGAC GTTCTGAACC CAGCTCGCGT 
(2) INFORMATION FOR SEQ ID NO: 54: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20986 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



CGGAGAAAAA 


CATGGCTAAG 


TCAAACTTTG 


AAAAAGTAGA 


ATCAGTTGTT 


GGCTGGGTTC 


60 


GTGATAAGAA 


AATCACAGGC 


TACCGTATCT 


CTAAAGAAAC 


GAATGCGCGT 


GAAATGTCTA 


120 


TCATTGCTCT 


GGCGCAGGGT 


CGTGCAAAAG 


TAAAAAATAT 


TTCATTTGAA 


ACAGCCCTAG 


180 


GCCTAATTGA 


TTTCTATGAA 


AAAAATTATG 


AAAAATTTGA 


AGATTAATCT 


TTGGATAACG 


240 


GCGGATTCTT 


GACCTTCAAG 


TAGTAGAGAT 


AGAGAATCTG 


CCTTTTCATT 


TTGAGGACAG 


300 


CAAAAAGACT 


GCACGGTTGA 


TGCAGCCTTT 


TCTTTTTATT 


TGAGATAGCG 


TTGAAGGAAC 


360 


TCTTTTGTTC 


GGTCTTCTTT 


AGGATTGGTG 


AAGAGGTCTT 


CTGGTTTACC 


TTCTTCAGCG 


420 


ATCACGCCCT 


TAT C CAT AAA 


GATAACACGG 


TGAGAGACAT 


CACGGGCAAA 


TTCCATTTCA 


480 


TGGGTTACGA 


CAATCATGGT 


CAAGCCTTCC 


TGAGCCAGGT 


C C TG C ATG AT 


TTTGAGGACT 


540 


TCTCCAACCA 


TTTCTGGATC 


GAGAGCTGAT 


GTTGGTTCAT 


CAAAGAGAAT 


AGCGTCCGGA 


600 


TTCATGGAGA 


GGGCACGAGC 


GATGGCCACA 


CGTTGTTTTT 


GACCACCTGA 


GAGTTGTTTT 


660 


GGTTTGGCTT 


GCCAGTAGCG 


TTCTCCCATG 


CCGACCTTTT 


CCAGGTTTTC 


TTTGGCAATC 


720 


TTTTCAGCTT 


CTGTGCGTTC 


GCGTTTTAGG 


ACAGTTGTCT 


GAGCGACGAT 


TGTGTTTTCA 


780 


AG AAC AT T G A 


GATTT T C AAA 


GAGGTTAAAG 


GATTGGAAAA 


CCATCCCCAA 


CTTTTCACGG 


840 


TATTGCGTGA 


GGTCATAGCC 


TTTTTCGAGG 


ACGTTTTGTC 


CATGATAAAG 


GATTTGTCCA 


900 


TCAGTTGGTG 


TTTCAAGTAG 


GTTAATGGAG 


CGTAGGAAGG 


TCGATTTTCC 


GCTTCCAGAG 


960 


CTTCCGATGA 


TAGAGATGAC 


CTCTCCCTTG 


TGGACAGTGA 


GTGAAATGTC 


TTTTAGCACT 


1020 


TCGTTTTGTC 


CATAGGATTT 


TTTGAGGTGT 


TTAATTTCAA 


GGATTGCTTG 


TGTCATTATT 


1080 


TCAAATCCTC 


CGTTTGCATT 


TGGTTAGCAC 


CTGTAGTGTA 


GGTATCCATG 


TCCATTCTGC 


1140 



WO 98/18931 



PCT7US97/19588 



479 



GCTCGATAAA 


GCGTAGGATA 


CGTGTTACGG 


TGAAGGTGAG 


GACAAAGTAA 


ATCACGGCGA 


1200 


I GATTGTAAA 


TGTCTGGAAG 


TATTGATAGG 


TTTGTGTTGC 


CACGGTATTT 


CCTGAGAAAT 


1260 


AAAGTTCGAC 


AACAGAGATA 


ACGTTCAATA 


C AG ATGT AT C 


TTTGATATTG 


ATGACAAATT 


1320 


CATTACCAGT 


TGCAGGTAGG 


ATGTTACGGA 


CTACCTGAGG 


T AGG AC AAT C 


TTACGCATGG 


1380 


TCTGGTTATG 


GGTCATACCA 


AGAGCAGTCG 


CAGCTTCAAA 


TTGTCCCTTG 


TCAACTGCTA 


1440 


GGATACCACC 


ACGGACGATT 


TCAGTCATGT 


AGGCACCGGT 


AT TG AT T G AA 


ACGATGAAGA 


1500 


TAGCAGCCAG 


TGTACGGTCA 


AGGTTGATCC 


CGAAAGCTTG 


GGCAGTTCCA 


TAGTAGATAA 


1560 


CCATCGATTG 


AACAATCATT 


GGCGTACCAC 


GGAAAATTTC 


AATGTAGACA 


T TG AG AAC C C 


1620 


AGCCGACTAG 


TTTTTGTAGG 


CCGTAAATGA 


CTTTGTTTTC 


AGAGAGAGGA 


GCAGTACGGA 


1680 


AGACACCAAT 


GGCAAGTCCA 


ATAATGAGAC 


CTATGATGGT 


TCCGACGATA 


GAGATTAAAA 


1740 


G AGTG AT AC C 


AGCACCACGC 


AAGAGTTGTT 


GCCAGTTTTC 


AGAAAGAATT 


TTAGCAACTT 


1800 


GGCTAAAGAA 


ACTACTGCTA 


GTCTCTTCAG 


TTGTTGTAGC 


TTCGGCAGGT 


TGTTCCTTGA 


1860 


TCATACGATC 


CATCAAGGCA 


ACTTGGTCAT 


CTTTTGAAAT 


GGTTTCAATG 


CTGGCATTGA 


1920 


TTTGGCTAAT 


ACGATTGTCA 


TTTTTACGAA 


GCCCGATAGC 


GATAGCTGTA 


TCTTCTTCCC 


1980 


CAGTTTTGAA 


ACCAGGTTCT 


ACTTG AAT C A 


TCTTGAACTT 


AGAGTTCGCA 


GCTTCAGCAG 


2040 


TCAGTGCTTC 


TGGACGTTCA 


GAAACATAAG 


CATC AAT G AC 


ACCAGCCTCA 


AGAGCTTGTC 


2100 


GCATTTGAGC 


GAAGTCTCCC 


ATGGCTGTTT 


CTTTTTTAGC 


ACCTGGGATT 


TGTGCAATCA 


2160 


AGTTATAAAG 


GTAGACCCCT 


TGTTGAGAAG 


TGATTTTTGC 


ACCGTTAAAG 


TCATCCAAAG 


2220 


AT T TAG C ACT 


TGCGTAGGCA 


GAATCTTTTT 


TGACAAGCAA 


AACTGGTTCG 


CTAGTATAGT 


2280 


AACTGCTCGA 


AAAGGCAATT 


TCTTGTTTGC 


GTTCTGCAGT 


TGGACTCATA 


CCTGCGATAA 


2340 


T C ATGT C AAT 


CTTACCAGAA 


GTAAGGGCAG 


GGACTAGACC 


TTCCCACTTG 


GTTTTAACAA 


2400 


CCAAAGGTTC 


TTTACCTAAG 


TCCTTAGCGA 


TTTTCTTGGC 


GATTTGAACA 


TCGTATCCGT 


2460 


TGGCATACTG 


ATTGGTCCCA 


TCGATTTTGA 


CAGCTCCGTT 


GCTATCATCA 


TCCTGGGTCC 


2520 


AGTTAAAGGG 


AGCATATGCT 


GCTTCCATAC 


CGATGCGTAA 


ATATTCATCG 


GCTTGAGCAA 


2580 


CATTGACAAG 


TCCTAGCATC 


AGCAAGAGAC 


TTGTGAAAAT 


AGATAAGTAy ATGTGGCTCA 


2640 


TGATTTCTCC 


TATTCTGATC 


TATTAAAAAA 


TAACTGTCTC 


CTATTTTATC 


GAAAAATGCG 


2700 


TAATTTTTCA 


ACATAAGTAA 


GTCTTTACTT 


ACGAAAAAAT 


GCTATAATGA 


TAAGAAAGAT 


2760 


AAAAAGGGGG 


CTTAGTTGAT 


GAAAAAAACT 


TTTTTCTTAC 


TGGTGTTAGG 


CTTGTTTTGC 


2820 


CTTCTTCCAC 


TCTCTGTTTT 


TGCCATTGAT 


TTCAAGATAA 


ACTCTTATCA 


AGGGGATTTG 


2880 
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TATATTCATG CAGACAATAC GGCAGAGTTT AGACAGAAGA TAGTTTACCA GTTTGAGGAG 2940 

GACTTTAAGG GCCAAATCGT GGGACTTGGA CGTGCTGGTA AGATGCCTAG CGGGT TTG AC 3 000 

ATTGACCCTC ATCCAAAGAT TCAGGCCGCG AAAAACGGTG CAGAACTAGC AG ATGT G ACT 30 60 

AGCGAAGTAA CAGAAGAAGC GGATGGTTAT ACTGTGAGAG TCTATAATCC AGGTCAGGAG 312 0 

GGCGACATAG TTGAAGTTGA CCTCGTCTGG AACTTAAAAA ATTTACTTTT CCTTTATGAT 3180 

GATATCGCTG AATTAAATTG GCAACCTCTG ACAGATAGTT CAGAGTCTAT TGAAAAGTTT 3240 

GAATTTCATG TAAGGGGAGA CAAGGGGGCT GAAAAACTCT TTTTCCATAC AGGGAAACTT 3 3 00 

TTTAGAGAGG GAACGATTGA AAAGAGTAAC CTTGATTATA CTATCCGTTT AGACAATCTT 3 3 60 

CCGGCTAAGC GTGGAGTTGA GTTGCATGCC TATTGGCCTC GGACCGATTT TGCTAGCGCT 342 0 

AGGGATCAGG GATTGAAAGG GAATCGTTTA GAAGAGTTTA ATAAGATAGA AGACTCGATT 3480 

GTTAGAGAAA AAGATCAGAG TAAACAACTC GTTACTTGGG TCCTCCCTTC GATCCTTTCC 3 540 

ATCTCCTTGT TATTGAGTGT CTGCTTCTAT TTTATTTATA GAAGAAAGAC CACTCCTTCA 3 600 

GTCAAATATG CCAAAAATCA TCGTCTCTAT GAACCACCAA TGG AAT TAG A GCCTATGGTT 3 660 

TTATCAGAAG CAGTCTACTC GACCTCCTTG GAGGAAGTGA GTCCCTTGGT CAAGGGAGCT 3 72 0 

GGAAAATTCA CCTTTGATCA ACTTATTCAA GCTACCTTGC TAGATGTGAT AGACCGTGGG 3 7 80 

AATGTCTCTA TCATTTCAGA AGGAGATGCA GTTGGTTTGA GGCTAGTAAA AGAAGATGGT 3 84 0 

TTGTCAAGCT TTGAGAAAGA CTGCCTAAAT CTAGCTTTTT CAGGTAAAAA AGAAGAAACT 3 900 

CTTTCCAATT TGTTTGCGGA TTACAAGGTA TCTGATAGTC TTTATCGTAG AGCCAAAGTT 3 9 60 

TCTGATGAAA AACGGATTCA AGCAAGAGGG CTTCAACTCA AATCTTCTTT TGAAGAGGTA 4 02 0 

TTGAACCAGA TGCAAGAAGG AGTGAGAAAA CGAGTTTCCT TCTGGGGGCT C C C AG AT TAT 4 0 80 

TATCGTCCTT TAACTGGTGG GGAAAAGGCC TTGCAAGTGG GTATGGGTGC CTTGACTATC 414 0 

CTGCCCCTAT TTATCGGATT TGGTTTGTTC TTGTACAGTT TAGACGTTCA TGGCTATCTT 42 00 

TACCTCCCTT TGCCAATACT TGGTTTTCTA GGGTTAGTTT TGTCTGTTTT CTATTATTGG 42 60 

AAGCTTCGAC TAGATAATCG TGATGGTGTT CTAAATGAAG CGGGAGCTGA GGTCTACTAT 432 0 

CTCTGGACCA GTTTTGAAAA TATGTTGCGT GAG ATT G C AC GATTGGATCA GGCTGAACTG 4380 

GAAAGTATTG TGGTCTGGAA TCGCCTCTTG GTCTATGCGA CCTTATTTGG CTATGCGGAC 444 0 

AAGGTTAGTC ATTTGATGAA GGTTCATCAG ATTCAAGTGG AAAATCCAGA TATCAATCTC 4 500 

TATGTAGCTT ATGGCTGGCA CAGTACGTTT TATCATTCAA CAGCACAAAT GAGCCATTAT 45 60 

GCTAGTGTCG CAAATACAGC AAGCACCTAC TCTGTATCTT CTGGAAGTGG AAGTTCTGGT 462 0 

GGTGGCTTCT CTGGAGGCGG AGGTGGCGGC AGTATCGGTG CCTTTTAAAG AGAGCTACCA 4 680 



WO 98/18931 



PCT/US97/19588 



481 

TAGACTGAAA AAGTATGATA TAATGGAAGA TAGAAAAAAG ACAAACTATA AGAAAAGTCA 4 74 0 

ATAGTTTTAT CTAAACTATT TCTTATTTCA ATTTGATGAT TTGGCGATGA TTTTAGAGCA 4800 

CGGCAAAAAG CCCTTGAAAA AGTCCATTTT TTCAAAGGTA ATCCTGTGTT AATTTCAGAA 4860 

ATTACATCAC TTTTTGTTCG TCAAATGGCA GCTCTTTTTT AGGATATAAA ACAGGGTTCG 4 920 

GATAAGTTTT TTTGCAAGGT GGATGATGGC TACATTGTAA TGTTTTCCTT ATTCTAACTT 4980 

AGTCTTAAGA TAGGCCTTAG AAGCAGGTGA AAAGCGAGGG CATGCTTTGG C AG CTTGT AT 5040 

GAGTGCCCAC CGCAGATGAG GGGAACCCCG TTTGACCATT CTTCCAGCTA AATCAATCTG 5100 

ACCTGACTGA TAAATAGAAG AATCCAGTCC AGCGAAAGCT TGTAATTGAG CAGGATTATC 5160 

AAAGGCATGA ATATTTCGAA TCTCGGCTAA AATGACCGCC CTAAACGATC CCCAATCCCA 52 2 0 

GTAACCGTCG TGATGACCGA GTTGAACTCA GCCATCGAGT CATTGATACA TGTTTCCGCC 52 80 

TTGTCAATGA GCCTCTTGTA ATGCTTGATG ATTTCGAATT CACGAGCAGG AGATGTTGTT 534 0 

CCGATAGAAC GAGGTGCGAC TGAGAGGATA TCCTGAATTT TAGAAGCGGT CAATCGCTTA 54 00 

ATTTCTATCA GCTTATCAAA TCCTGCCTCA ATCCTTTTCT GAGGATTAGG GTAGCGTGTC 5460 

AAGAGTTGGT AGGTATATTC TGAATGCTTT CCAACGATTT TATCCAACTC AGGAAAGATG 552 0 

AT AT C AAG AC AACGAGTGTA TTGTACTTTC CAATCAGACT GTTTTTCTTG AGACGATGAA 5580 

TATGTCTAGC CAGTATTTTT AGGTCTACTT G C CG AT T AT C GTGTTGAAAT TGTTCACGAT 5 640 

TGGGGTCAGA AAGAAGTTTA AGAGCGATGC CATGAGCGTC TTTCTTATCC GTTTTAGTCT 5700 

TGCGAAGTGA TAATGATTTG GCAAATTCCT TGATGAGCAA AGGATTGTAG GTGTAAACTT 5760 

TATATCCTTG TTCATGCAGG AAGTTCAGTA GATTAAAGGC AT AATGT CCA GTATCTTCAA 5820 

GAGCGATGAG ACAGTCTTGG TTGATCTGTC GAATAGACAG ATCTAAGAGT TCAAAACCAG 5880 

CT T T ATT ATT TGAAAAAGTG AGTGGTTTAA GAACAGTTTT TCCTGGAACA TTCAAGGCTG 5 940 

TAACATCGTG TTTATTTTTA GCGATATCAA TGCCTACATA AAGCATGGGA GTACCTCC£G 6000 

ATATAGTATT TCAAGTCTAC TTGGTT AT CC ACGAATTTTT TGCCTTGTTA CCTTAGACGA 6060 

GATCAAACGT C T ATGCGTT A TCAAACTCAT TACCAATTGA AACAAAAGCT GTGGTTAGAG 6120 

CCTTTCGGAA ATCGTCAAGC GATTGGAGGA AATGAACTAA TCCATAGTGG CTTATTCCAA 6180 

GTATACCACT TGGGCTTTGG CAGTAGCTAA CTGCGCTAAA TATAATATAG GGAGTAATCT 6240 

ATGTATCTTA TTGAAATTTT AAAATCTATC TTCTTCGGAA TTGTTGAAGG AATTACGGAA 6300 

TGGTTGCCGA TTTCCAGTAC AGGTCACTTG ATTTTAGCAG AGGAATTCAT CCAATACCAA 63 60 

AATCAAAATG AAGCCTTTAT GTCCATGTTT AATGTCGTGA TTCAGCTTGG TGCTATTTTA 6420 
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GCAGTTATGG 


TG AT T TAT T T 


TAACAAGCTC 


AATCCTTTTA 


AACCGACCAA 


GGACAAACAG 


6480 


GAAGTTCGTA 


AGACTTGGAG 


ACTATGGTTG 


AAGGTCTTGA 


TTGCTACTTT 


ACCTTTACTT 


6540 


GGTGTCTTTA 


AATTTG AT G A 


TTGGTTTGAT 


ACCCACTTCC 


ATAACATGGT 


TTCAGTTGCT 


6600 


CTCATGTTGA 


TTATCTACGG 


GGTTGCCTTC 


ATCTATTTGG 


AAAAGCGCAA 


TAAAGCGCGT 


6660 


GCTATCGAGC 


CAAGTGTAAC 


AGAGTTGGAC 


AAGCTTCCTT 


ATACGACCGC 


TTTCTATATC 


6720 


GGACTCTTCC 


AAGTTCTTGC 


TCTTTTACCA 


GGGACTAGCC 


GTTCAGGTGC 


AACGATTGTC 


6780 


GGTGGTTTGT 


TAAATGGAAC 


CAGTCGTTCA 


GTTGTGACAG 


AATTTACCTT 


CTATCTTGGG 


6840 


ATTCCTGTTA 


TGTTTGGAGC 


TAGTGCCTTA 


AAGATTTTCA 


AATTTGTGAA 


AGCCGGAGAA 


6900 


CTCTTGAGCT 


TTGGGCAATT 


GTTTTTGCTC 


TTGGTCGCGA 


TGGGAGTAGC 


TTTTGCGGTC 


6960 


AGCATGGTGG 


CTATTCGCTT 


CTTGACCAGC 


TATGTGAAAA 


AACACGACTT 


CACCCTTTTT 


7020 


GGTAAATACC 


GTATCGTGCT 


TGGTAGTGTT 


TTGCTACTTT 


ACAGTTTTGT 


CCGTTTATTT 


7080 


GTATAAGAAA 


AACCTTGAAG 


GGGCAACTCT 


TCAAGGTTTT 


ATACTCTTCG 


AAAATCTCTT 


7140 


CAAACCGCGT 


CAGCTTTATC 


TGCAACCTCA 


AAACAGTGTT 


TTGAGCAGCn 


CTGCGGCTAG 


7200 


CCTCCTAGTT 


TGCTCTTTGA 


TTTTCATTGA 


GCTTTAAAAT 


CCAGTCATGG 


TAATCCCCAA 


7260 


TAGGCGGACA 


CCTCTTTCTT 


TCTTGCTTAA 


TTCTTCATAG 


AGTTGCAGGG 


CTATTTGGCT 


7320 


TATCTGACTA 


GCATCTTGTG 


TTTTTTGAGC 


AAGACTTTTT 


CGTTTGGTAA 


GAGTTGAAAA 


7380 


GTCCTCGTAG 


CGGATTTTCA 


AAATGACAAT 


TTTTCCAGCT 


TTTTCTTGTT 


GAT GT AG ATT 


7440 


GAGAGCGACT 


TTTTCTGATA 


GAAGAGTCAG 


CTCTTTTTTG 


ATATCTTCCT 


CAGCAAGGAG 


7500 


AATCTTCCCG 


TAGGTTTTCT 


CCTTGCCGAT 


TGATTTACGG 


ATGCGATTGG 


ATTTGACTGG 


7560 


AGAGTTGTGA 


ATGCCACGAG 


CCTTTCGATA 


CAGATCATAG 


CCTAGTCTAC 


CAAAACGGTC 


7620 


TATTAGGGTT 


ACCTCAGGAA 


CTTCAAGTAA 


AT C AG C ACC A 


GTAAAAACGC 


CCATTTGATG 


7680 


AAGACGTTCT 


ACTGTCTTTT 


TTCCTACTCC 


ATGAAATTTG 


GAAATATCCA 


TTTGTTTGAG 


7740 


AAAATCCTCA 


GCCTGTTCAG 


GTAGAATCAC 


TGTCAAACCA 


TGTGGTTTTT 


GATAATCACT 


7800 


CGCCATTTTA 


GCTAAGAATT 


TGTTGTAAGA 


AACGCCTGCG 


GAAGCAGTTA 


GATGGAGTTC 


7860 


TTGCCAGATA 


TCTTTTTGAA 


TGAGGCGAGC 


AATTTTGACC 


GCTGACTTGA 


TACCGAGTTT 


7920 


ATTTTCTGTC 


ACATCCAAAT 


AGGCTTCGTC 


AATGCTCATG 


GGTTCAATCA 


AATCTGTATA 


7980 


GCGCTTAAAA 


ATAGCTCGAA 


TCTGGAGTCC 


C AC AG ACT TG 


TATTTCTCAT 


AATTCCCTGA 


8040 


GATAAAGACA 


GCCTGGGGAC 


AACGTTCATA 


AGCTTCCTTG 


GAACTCATGG 


CAGAATGGAC 


8100 


ACCAAAAGCT 


CTTGCCTCAT 


AACTACAGGT 


AGAAACGACT 


CCCCGTCCAC 


CTGTTTGCCG 


8160 


AGGGTCGCTT 


CCAATAATGA 


CAGGTTTTCC 


TCTGAGTTTA 


GG ATT AT C C C 


TG AT TT C C AC 


8220 
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TGCAGCAAAA AAGGCATCCA TGTCAATATG GATGATTTTT CTTGACAAAT CATTTAACAA 82 8 0 

AGGAAAAATC AACATGCCTA GCACCTTTTT ATACTCTTCG AAAATCTCTT CAAACCACGT 8340 

CAGCTCTATm TGCAACCTCA AAACAGTGTT TTGAGCAATC TGCGGCTAGC TTCCTAGTTT 8400 

GCTTTTCGAT TT C CAT T GAG TGTTACTGCT TATTyTCTTT TATTATACCC TTTTTTCTGA 8460 

AAAAAAGAAA AAAGGACTTT ATTTTTTCAA AAATATAATA CAGTTTGAAA TAAAATATAG 852 0 

ACTGTTTTAG AAAAGAAAGT GTAAAAATAG GGAATTTTCA CTTGTTGAAA TCGGTTACTA 85 8 0 

TATGGTATAC TTGTCTTATG AATGTAACAG ATGACTGTTA CTAGAAAAAA GAGGACATTA 8 64 0 

ATATGGTTGT TAAGACAGTT GTTGAAGCAC AAGATATTTT TGACAAAGCT TGGGAAGGCT 87 00 

TCAAAGGCGT AGAT TGG AAA GAAAAAGCAA GTGTATCACG ATTTGTACAA GCTAACTACA 87 60 

CACCTTATGA TGGAGACGAA AGCTTCCTTG CAGGACCAAC AGAGCGTTCA CTTCACATCA 8 82 0 

AG AAAATT GT AGAAGAAACT AAAGCACACT ACGAAGAAAC TCGTTTCCCA ATGGACACTC 8880 

GTCCAACATC TATCGCTGAT ATCCCTGCTG GATTTATCGA CAAAGAAAAT GAAGTTATCT 8940 

TCGGTATCCA AAACGATGAA CTCTTCAAAT TGAACTTCAT GCCAAAAGGT GGTATCCGTA 9 000 

TGGCTGAAAC TACTTTGAAA GAAAATGGAT ACGAACCAGA CCCAGCTGTT CACGAAATCT 9060 

TCACTAAATA TGTAACAACA GTTAACGACG GTATTTTCCG TGCCTACACT TCAAATATTC 912 0 

GTCGCGCTCG TCACGCACAC ACTGTAACTG GTCTTCCAGA TGCATACTCA CGCGGACGTA 9180 

TCATCGGTGT TTACGCACGT CTTGCTCTTT ACGGTGCAGA CTACTTGATG CAAGAAAAAG 9240 

TAAATGACTG GAATGCAATC AAAGAAATCG ATGAAGAAAC AATCCGTCTT CGTGAAGAAG 9 3 00 

TAAACCTTCA ATACCAAGCA TTGCAACAAG TTGTTCGCCT GGGTGACCTT TACGGGGTTG 9 3 60 

ATGTTCGCAA ACCAGCGATG AACGTGAAAG AAGCAATCCA ATGGGTTAAC ATTGCTTTCA 942 0 

TGGCTGTCTG CCGTGTGATT AACGGTGCTG CTACATCTCT AGGTCGTGTA CCAATCGTAT 94 80 

TGGACATCTT TGCAGAACGT GACCTTGCTC GTGGTACATT T ACT G AATC A GAAATCCAAG 9 54 0 

AATTCGTTGA TGATTTCGTT ATGAAACTTC GTACAGTTAA ATTTGCTCGT ACAAAAGCTT 9 600 

ATGACCAATT GTACTCAGGT GACCCAACCT TT AT C AC AAC TTCTATGGCT GGTATGGGTA 9 660 

ACGACGGTCG TCACCGTGTT ACTAAGATGG ACTACCGTTT CTTGAACACT CTTGACAACA 972 0 

TCGGTAACTC ACCAGAACCA AACTTGACAG TTCTTTGGAC TGACAAATTG CCATACAACT 97 80 

TCCGTCGCTA CTGTATGCAC ATGAGCCACA AACACTCTTC TATCCAATAC GAAGGTGTAA 9840 

CAACAATGGC TAAAGACGGA TATGGTGAAA TGAGCTGTAT CTCATGCTGT GTGTCTCCAC 9900 

TTGATCCAGA AAATGAAGAA CAACGCCACA ACATCCAGTA CTTCGGTGCT CGTGTAAACG 9960 
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TTCTTAAAGC 


CCTTCTTACT 


GGTTTGAATG 


GTGGTTACGA 


CGATGTTCAC 


AAAG AC TAC A 


10020 


AAGTATTTGA 


TATCGAACCA 


ATCCGTGACG 


AAGTTCTTGA 


ATTTGAATCA 


GTTAAAGCGA 


10080 


ACTTTGAAAA 


ATCTCTTGAC 


TGGTTGACTG 


ACACTTACGT 


AGATGCCTTG 


AACATCATCC 


10140 


ACTACATGAC 


TGATAGGTAC 


AACTACGAAG 


CTGTTCAAAT 


GGCCTTCTTG 


CCAACTAAAC 


10200 


AACGTGCCAA 


CATGGGATTC 


GGTATCTGTG 


GATTTGCTAA 


CACTGTTGAT 


ACATTGTCAG 


10260 


CTATCAAATA 


CGCTACAGTT 


AAACCAATCC 


GTGACGAAGA 


TGGCTACATC 


TACGATTACG 


10320 


AAACAATCGG 


TG ACT AC CCA 


CGCTGGGGTG 


AAGATGACCC 


ACGTTCAAAC 


GAATTGGCAG 


10380 


AATGGTTGAT 


CGAAGCTTAC 


ACAACTCGTC 


TACGTAGCCA 


CAAACTATAC 


AAAGACGCAG 


10440 


AAGCTACAGT 


ATCACTTTTG 


ACAATCACAT 


CTAACGTTGC 


TTACTCTAAA 


CAAACTGGTA 


10500 


ACTCACCAGT 


TCACAAAGGT 


GTATACCTCA 


ACGAAGATGG 


TTCTGTGAAC 


TTGTCTAAAC 


10560 


TTGAATTCTT 


CTCACCAGGT 


GCTAACCCAT 


CTAACAAAGC 


TAAAGGTGGT 


TGGTTGCAAA 


10620 


ACTTGAACTC 


ACTTTCTAGC 


CTTGACTTTA 


GTTATGCAGC 


TGACGGTATC 


TCATTGACTA 


10680 


CACAAGTATC 


ACCTCGCGCT 


CTTGGTAAGA 


CTC GTGATG A 


AC AAGTT G AT 


AACTTGGTAA 


10740 


CAATTCTTGA 


TGGTTACTTC 


GAAAACGGTG 


G ACAAC AC G T 


TAACTTGAAC 


GTTATGGACT 


10800 


TGAACGATGT 


TTACGAAAAA 


ATCATGTCAG 


GCGAAGACGT 


TATCGTACGT 


ATCTCTGGAT 


10860 


ACTGTGTAAA 


CACTAAATAC 


CTCACTCCAG 


AACAAAAAAC 


TGAATTGACA 


CAACGTGTCT 


10920 


TCCACGAAGT 


TCTTTCAATG 


GATGACGCCT 


TGGATGCATT 


GAGCTAATCA 


AGTTCTTGAA 


10980 


TAATAAAAAG 


GAACCCTCGG 


TCAAACGACT 


GAGGGTTTTG 


TGCTTGGGAT 


AGTATGAGCA 


11040 


ATTCCTTCGG 


CGCAATATGC 


AATGTTTTTG 


GGCTCTTTGT 


CAACTGTAGT 


GGGTTGAAAA 


11100 


AAAGCTAAGC 


TTGAGAAAGG 


ACAAATTTCG 


TCCTTTCTTT 


TTTGATGTTC 


AGGGCGATAA 


11160 


AAATCCGTTT 


TTTGAAGTTT 


TCAAAGTTCC 


GAAAACCAAA 


GGCATTGCGC 


TTGATGTCTT 


11220 


TGATGAGTTT 


GTTAGTGGCC 


TCAAGTTTAG 


CGTTAGAATA 


AGGCAATTCA 


ATGGCGTTAG 


11280 


TGATGTAGTT 


TTTATAGCAA 


ATAAATGTGC 


TCAAAGTGGT 


TTTAAAGGTG 


CGGTTGAGAT 


11340 


GAGGTAACGT 


GTCTTGAATT 


AAGCCCCAAA 


ACTGGTCAGT 


ATTCTTCTCT 


TGTAGATGAA 


11400 


ATAGGAGTAG 


TTGATACAGG 


TCATAGTAAT 


CTTTAAGTTC 


AGGTACTAGA 


GT AAAG ATT T 


11460 


TCTTCAGACA 


CTCCCTAGGA 


GTTAAGGTCT 


CTCTGAAAGT 


TCTAGCATAG 


AAAGGCTTAA 


11520 


GAGAGAGTTT 


CCGACTATCT 


TTTAGGATAA 


ATTTCCAGTA 


ATATTTAAGA 


GCTCTGTATT 


11580 


C C AG AG AT TT 


ATCATCAAAT 


TGCTTCATGA 


TGTTGATTCT 


AGTCTGATTA 


AGAGCCCTGC 


11640 


TCATGTGTTG 


GACAATGTGG 


AAACGATCGA 


GAACAATTTT 


AGCATTGGGA 


AATAATTTCT 


11700 


TAATGAGAGG 


G AT ATAACT T 


C C AG AC AT AT 


CAACAGTGAC 


GACTTTAACT 


TTTTTTCTAG 


11760 
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CTTCTTTCGA 


GTACTTGAAG 


AAATGATTTC 


GGATGGTTGT 


TTGACGTCTG 


TTATCAAGAA 


11820 


TGGTCATGAT 


TTTCTTAGTG 


TTGAAATCCT 


GAGCAATGAA 


AGCCAATTTC 


CCCTTCTGGT 


11880 


AGGAGAATTC 


ATCCCAGGAG 


AGGATTTCAG 


GCAAAGTGGT 


GTAATCCTCT 


TGGAAATGAA 


11940 


ATTGCTTGAG 


CTTACGATAG 


ACGGTAGAGG 


TAGAGGTAGA 


GGTAGAGATG 


GCTAATTTAG 


12000 


AAGCGATATG 


TGTAAGAGCC 


TCTCTGTTGA 


GTAGGAGTTG 


GGCAATTTTC 


TGTCTCACCA 


12060 


TT T C CG AG AT 


TTGGCAATTT 


TTCTGAACGA 


GAGTTGTTTC 


AG C T AC AGTG 


ACTTTCCGAC 


12120 


AGGACTTGCA 


TTGAAATCGT 


CTCTTTTTCA 


AATGAATGAG 


GCT AGG G AAA 


CCACCAATCT 


12180 


CGATAAAAGG 


GATTTTAGAA 


GGCTTTTGGA 


AGTCGTATTT 


GATTTGTTTT 


CCTTTACAGT 


12240 


GTTTACATTT 


AGGTGGGTGA 


TAATCAAGTG 


TAGCGAAGAC 


TTCGATATGG 


GTATCGTGCT 


12300 


GAATGGCTTT 


ATTTAAGGTG 


ATGTTTTTGT 


CTTTTATTCC 


GATGAGTAAT 


GTGGTATGAT 


12360 


TGATGTGTTC 


CATAAGATAC 


TTTCTAATGA 


GTTGTTTAGG 


CGCTTTTCAT 


TATAAGTCTT 


12420 


ATGGGACTTT 


TTTGATACTC 


AAAAAGCCCT 


ATAATCTCCA 


CAGTGGGATT 


TACCCACTAC 


12480 


AGAAATTATA 


G AG CC AG AAA 


AAACACTTTT 


GT T C AC TAG C 


AGAAACTAGA 


GAGCAGAAGT 


12540 


GTTTTTCTGT 


TCAGATTTAC 


CCAAAACTGG 


GAAATATGGG 


GATAAGAATA 


GAGATGGCTT 


12600 


AGGAAGCCCC 


TTTTTGTGTG 


TAGACAGTAC 


GATGAACTTA 


TAACAAATAG 


TGAGCCTTTT 


12660 


TAGCAATCAT 


TGCGACCCGT 


TTGTCAAAAG 


CCTCTTTTCG 


GATATCTACA 


ATTGTCTGAT 


12720 


AGATGAGACG 


CTGTTGGCTA 


ACATGCAAAT 


CTAAGGCAAT 


CGTCAAAAAG 


TGATGTTTCC 


12780 


CTTTGGGATA 


CTGCTTTTTA 


ACGTAAGGCA 


GGTATTCTTT 


CGTTGTAATA 


ATAATCAATG 


12840 


GCTCTGTCAA 


ATGCTCCTCT 


GAAGGAGGAG 


GACTAATTAG 


AATATTGTAT 


CCTGTAACAG 


12900 


AGGCAACTTT 


GTCAGTAAAA 


TTCCGTAAAA 


TAATGGACTT 


TATTAAGTTT 


ACATCTGCTT 


12960 


G ATT ATT T AA 


AATGATAAAA 


ATCGGGATAG 


CAGGTAGTGA 


GGAAAAGATG 


GTTTCTGTCA 


13020 


AGTAGAGTGA 


GAAAAGGTAC 


AGCCGATGCT 


GGTCGATAAC 


TCCTTCAATC 


TTCTGCTCAG 


13080 


TCATCCACTC 


TTGAACAATT 


GCTTTCGAAA 


TATGATACAG 


TGGCTTGTCG 


CTTTCAATCC 


13140 


CATAATGTTC 


GTAATAATTA 


TAATAGGGAA 


CTAGATTTTG 


TAAACCAAAC 


AAAAACGTTC 


13200 


TTGTTAAGAA 


AGTCAGTGCT 


GTTAAAAAAG 


AAAGAGAATT 


CGAAATGTCA 


TTTCCTAAGA 


13260 


TATTCTTGAA 


CTTGGATAGT 


AGATGCTTTC 


CTCTTGTATG 


CTGAAGAATC 


AGTTGAATAG 


13320 


TATGAGTCTT 


TTTTTCTTGA 


TTCCATTTGT 


CCTTGGAAAA 


CGAAGAATTA 


GCAGAACAAT 


13380 


AAACCAAAAA 


GATATAATCC 


AGTTCTTCCT 


GAGTAAAAGT 


CATGTTGGCA 


TGTGGCTCTA 


13440 


AGTAAGTTTG 


GCAATGTTCC 


ATCAAAATCG 


GATACATAAA 


GAGGTTTTTT 


AATTTTTCAA 


13500 
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ACTCTTTGGA CTCAGGGAAC TCAAGTGGAA ATTCCCGACG TTTCCAAGTG AGTGCCACTA 13 5 60 

GTATGCTAAA AT GAAC AT AC TCGTCAGGTG TGATTTCTAA CAGTTCATGA CTGAGTTGAG 13 62 0 

AATT AG AC TG CACAATCATA TGTGTGACCC AAT C CAT ACT TCCATCATTC AAAT CAT AAA 13 680 

TCTCAATACC AAAATGAAAC TGGAGGAGTG CAATTAAAAA AC G AAT G CG A TATTCAGGAC 13 74 0 

CAACTACTTG ATTTTTCACA AGGTCCAAAC CTACTGAACG TAGTAACAAG CCACACTTTT 13 800 

GTCGTACGCG GTAGCCTGTT GCGATGGAAA TATACTCTTT TTGTGTAAAT TCGTTAAAGC 13 8 60 

TTTGATTACC TTGTAGTAGA AAGAAGCGGA GTATTTTTAA AATAGTTGAT TGGTTATAAA 13 92 0 

GCTGATGGAA GT AAT AAT TC GTTTGATGAG AATGGTGTTC G ATT AAT TG A ACTTGTTGCG 13 980 

TATCTAAATT AAAT GT C AAC TCTTCCTCGA ATGTTTCTTG T AATT C C TG C AAAATGCTTA 14040 

GGAGACTTTT AGATTGTAAT GAAGTTAAAG TAGACAGTTC ATCTAGTTCA ATAGACCGAA 14100 

TATCCAATAA TATATTTAAA ATGGTAATTT TATCTGTAAT TCTTTTTTCA ATGTATTTGT 14160 

TTAGCATAGT TACCGAATCT TAGTTGCATA TAGATAATTT TAATTATTAT AATACAAAAG 142 2 0 

AAACTAATTG TCTTGTCAAA AAGGT TGTGG AATTTCCGAC TTTATTGATA AAAC AG C AT G 14 2 80 

TAATAAAAGG CAT T T T AAAG ATAGTAATGA GTATTGGTGG AGTTTTATGG CTTATTTTTT 14 340 

TTATTAGAAA ATATTTTTTT ATCAAATATT GTCGTTCTAT AAAAAAATAT GTGATAAAAA 144 00 

TATCTATTGT GATGGAAGTT GTTTTAATTT ATACTAGGAT AGTTAATAGT AATACTATAC 14460 

TATACTATAT TGTATACAAG TGTGTCATTG CCAGGTTGAG AAGATAGCTA TAACGCACTT 1452 0 

TTATACGCTT TTGCTACGTT TGTTAGTGAA CGGATTAACT CAGTGAGATA AATTTT AT C A 14 580 

GAACATAAGT AATCCGTTTC TTCGTGTATA C AG AT TG AAA GTACCTATGA AT C AT AG AAG 14 640 

GATTAACTTG TTCTATGAAT AATGCT T AAC AGGGAGACAC ACATGAAAAA AGTAAGAAAG 147 00 

ATATTTCAGA AGGCAGTTGC AGGACTGTGC TGTATATCTC AGTT G AC AG C TTTTTCTTCG 147 6 0 

ATAGTTGCTT TAGCAGAAAC GCCTGAAACC AGTCCAGCGA TAGGAAAAGT AGTGATTAAG 14 82 0 

GAGACAGGCG AAGGAGGAGC GCTTCTAGGA GATGCCGTCT TTGAGTTGAA AAAC AAT AC G 14 88 0 

GATGGCACAA CTGTTTCGCA AAGGACAGAG GCGCAAACAG GAG AAG CG AT ATTTTCAAAC 14940 

ATAAAACCTG GGACATACAC CTTGACAGAA GCCCAACCTC CAGTTGGTTA TAAACCCTCT 15000 

ACTAAACAAT GGACTGTTGA AGTTGAGAAG AATGGTCGGA CGACTGTCCA AGGT G AAC AG 15060 

GTAGAAAATC G AG AAGAGG C TCTATCTGAC CAGTATCCAC AAACAGGGAC T T AT C C AG AT 15120 

GTTCAAACAC CTTATCAGAT TATTAAGGTA GATGGTTCGG AAAAAAACGG AC AG C AC AAG 15180 

GCGTTGAATC CGAATCCATA TGAACGTGTG ATTCCAGAAG GT AC ACT TT C AAAG AG AAT T 15240 

TATCAAGTGA ATAATTTGGA TG AT AAC C AA TATGGAATCG AAT TG ACGGT TAGTGGGAAA 153 00 
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ACAGTGTATG AACAAAAAGA TAAGTCTGTG CCGCTGGATG TCGTTATCTT GCTCGATAAC 153 60 

TCAAATAGTA TGAGTAACAT TCGAAACAAG AATGCTCGAC GTGCGGAAAG AGCTGGTGAG 1542 0 

GCGACACGTT CTCTTATTGA T AAAAT T AC A TCTGATTCAG AAAATAGGGT AGCGCTTGTG 15480 

ACTTATGCTT CCACTATCTT TGATGGGACC GAGTTTACAG TAGAAAAAGG GGTAGCAGAT 15 540 

AAAAACGGAA AGCGATTGAA TGATTCTCTT TTTTGGAATT ATGATCAGAC GAGTTTTACA 15 6 00 

ACCAATACCA AAG AT TAT AG TTATTTAAAG CTGACTAATG ATAAGAATGA CATTGTAGAA 15660 

TTAAAAAATA AGGT AC CT AC CGAGGCAGAA GACCATGATG G AAAT AG AT T GATGTACCAA 15720 

TTCGGTGCCA C T TTT ACT C A GAAAGCTTTG ATGAAGGCAG ATGAGATTTT GACACAACAA 15780 

G CG AG AC AAA ATAGTCAAAA AGTCATTTTC CATATTACGG ATGGTGTCCC AACTATGTCG 15 840 

TATCCGATTA ATTTTAATCA TGCTACGTTT GCTCCATCAT ATCAAAATCA ACTAAATGCA 15 9 00 

TTTTTTAGTA AATCTCCTAA TAAAGATGGA AT AC T AT T AA GTGATTTTAT TACGCAAGCA 15960 

ACTAGTGGAG AACATACAAT TGTACGCGGA GATGGGCAAA GTTACCAGAT GTTTACAGAT 1602 0 

AAGACAGTTT ATGAAAAAGG TGCTCCTGCA GCTTTCCCAG TTAAACCTGA AAAAT ATTCT 16 080 

GAAATGAAGG CGGCTGGTTA TGC AGT TATA GGCGATCCAA TTAATGGTGG ATATATTTGG 1614 0 

CTTAATTGGA GAGAGAGTAT TCTGGCTTAT CCGTTTAATT CTAAT ACT GC T AAAATT AC C 162 00 

AATCATGGTG AC CC T AC AAG ATGGTACTAT AACGGGAATA TTGCTCCTGA TGGGTATGAT 16260 

GTCTTTACGG TAGGTATTGG TATTAACGGA GATCCTGGTA CGGATGAAGC AACGGCTACT 16320 

AGTTTTATGC AAAGTATTTC TAGTAAACCT GAAAACTATA CCAATGTTAC TGACACGACA 16380 

AAAAT ATTGG AACAGTTGAA TCGTTATTTC CACACCATCG TAACTGAAAA GAAATCAATT 1644 0 

GAGAATGGTA C G ATT AC AG A TCCGATGGGT GAGTTAATTG ATTTGCAATT GGGCACAGAT 16500 

GGAAGATTTG ATCCAGCAGA TTACACTTTA ACTGCAAACG ATGGTAGTCG CTTGGAGAAT 16560 

GGACAAGCTG TAGGTGGTCC ACAAAATGAT GGTGGTTTGT TAAAAAATGC AAAAGTGCTC 16620 

TATGATACGA CTGAGAAAAG GATTCGTGTA ACAGGTCTGT ACCTTGGAAC GGATGAAAAA 16680 

GTTACGTTGA CCTACAATGT TCGTTTGAAT GATGAGTTTG TAAGCAATAA ATTTTATGAT 16740 

ACCAATGGTC GAACAACCTT ACATCCTAAG GAAGTAGAAC AGAACACAGT GCGCGACTTC 16800 

CCGATTCCTA AGATTCGTGA TGTGCGGAAG TATCCAGAAA TCACAATTTC AAAAGAGAAA 16 860 

AAACTTGGTG ACATTGAGTT TATTAAGGTC AATAAAAATG ATAAAAAACC ACTGAGAGGT 16920 

GCGGTCTTTA GTCTTCAAAA AC AAC AT C CG GATTATCCAG ATATTTATGG AGCTATTGAT 16980 

CAAAATGGCA CTTATCAAAA TGTGAGAACA GGTGAAGATG GTAAGTTGAC CTTTAAAAAT 17040 
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CTGTCAGATG 


GGAAATATCG 


ATTATTTGAA 


AATTCTGAAC 


CAGCTGGTTA 


TAAACCCGTT 


17100 


CAAAATAAGC 


CTATCGTTGC 


CTTCCAAATA 


GTAAATGGAG 


AAGT C AG AG A 


TGTGACTTCA 


17160 


ATCGTTCCAC 


AAG AT AT AC C 


AGCGGGTTAC 


GAGTTTACGA 


AT GAT AAG C A 


CTATATTACC 


17220 


AATGAACCTA 


TTCCTCCAAA 


GAGAGAATAT 


C C T CG AAC TG 


GTGGTATCGG 


AATGTTGCCA 


17280 


TTCTATCTGA 


TAGGTTGCAT 


GATGATGGGA 


GGAGTTCTAT 


TATACACACG 


GAAACATCCG 


17340 


T AAAGTG T AG 


AAATGATAAT 


ATCTATGTTC 


TGAACGATAC 


TT T T AAG AAG 


TAG C AC TC AA 


17400 


GAAGAGATTT 


AAGTTTACTT 


GGTGAAACCT 


GTTTTATTCG 


TAAGTAAACT 


ATCATTGAAA 


17460 


GGGGAGATGT 


TTTCGAAAAC 


TTGCACAGAA 


AAAGGATTAT 


T AT TGT C ATG 


TGTAATTCAT 


17520 


TACATTGCTC 


ACAGTTGATT 


TTAAGAGATA 


TGAATAAGGA 


GAAATCATGA 


AATCAATCAA 


17580 


CAAATTTTTA 


ACAATGCTTG 


CTGCCTTATT 


ACTGACAGCG 


AGTAGCCTGT 


TTTCAGCTGC 


17640 


AACAGTTTTT 


GCGGCTGGGA 


CGACAACAAC 


ATCTGTTACC 


GTTCATAAAC 


TATTGGCAAC 


17700 


AGATGGGGAT 


ATGGATAAAA 


TTGCAAATGA 


GTTAGAAACA 


GGTAACTATG 


CTGGTAATAA 


17760 


AGTGGGTGTT 


CTACCTGCAA 


ATGCAAAAGA 


AATTGCCGGT 


GTTATGTTCG 


T T TGG AC AAA 


17820 


TACTAATAAT 


GAAATTATTG 


ATGAAAATGG 


CCAAACTCTA 


GGAGTGAATA 


TTGATCCACA 


17880 


AAC AT T T AAA 


CTCTCAGGGG 


CAATGCCGGC 


AACTGCAATG 


AAAAAATTAA 


C AG AAGCT G A 


17940 


AGGAGCTAAA 


TTTAACACGG 


CAAATTTACC 


AGCTGCTAAG 


TATAAAATTT 


ATG AAATT C A 


18000 


CAGTTTATCA 


ACTTATGTCG 


GTGAAGATGG 


AG C AACCTT A 


ACAGGTTCTA 


AAGCAGTTCC 


18060 


AATTGAAATT 


GAATTACCAT 


TGAACGATGT 


TGTGGATGCG 


CATGTGTATC 


CAAAAAATAC 


18120 


AGAAGCAAAG 


CCAAAAATTG 


ATAAAGATTT 


CAAAGGTAAA 


GCAAATCCAG 


ATACACCACG 


18180 


TGTAGATAAA 


GATACACCTG 


TGAACCACCA 


AGTTGGAGAT 


GTTGTAGAGT 


ACGAAATTGT 


18240 


TACAAAAATT 


CCAGCACTTG 


CTAATTATGC 


AACAGCAAAC 


TGGAGCGATA 


GAATGACTGA 


18300 


AGGTTTGGCA 


TTCAACAAAG 


GTACAGTGAA 


AGTAACTGTT 


GATGATGTTG 


CACTTGAAGC 


18360 


AGGTGATTAT 


GC T CT AAC AG 


AAGTAGCAAC 


TGGTTTTGAT 


TTGAAATTAA 


CAGATGCTGG 


18420 


TTTAGCTAAA 


GTGAATGACC 


AAAACGCTGA 


AAAAACTGTG 


AAAATCACTT 


ATTCGGCAAC 


18480 


ATTGAATGAC 


AAAGCAATTG 


T AG AAGT AC C 


AGAATCTAAT 


GATGTAACAT 


TTAACTATGG 


18540 


TAATAATCCA 


GATCACGGGA 


ATACTCCAAA 


GCCGAATAAG 


C C AAATG AAA 


ACGGCGATTT 


18600 


GACATTGACC 


AAGACATGGG 


TTGATGCTAC 


AGGTGCACCA 


ATTCCGGCTG 


GAGCTGAAGC 


18660 


AACGTTCGAT 


TTGGTTAATG 


CTCAGACTGG 


TAAAGTTGTA 


CAAACTGTAA 


CTTTGACAAC 


18720 


AGACAAAAAT 


ACAGTTACTG 


TTAACGGATT 


GGATAAAAAT 


ACAGAATATA 


AATTCGTTGA 


18780 


ACGTAGTATA 


AAAGGGTATT 


CAGCAGATTA 


TCAAGAAATC 


ACTACAGCTG 


GAGAAATTGC 


18840 
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TGTCAAGAAC TGGAAAGACG AAAATCCAAA ACCACTTGAT CCAACAGAGC CAAAAGTTGT 18900 

TACATATGGT AAAAAGTTTG TCAAAGTTAA TGATAAAGAT AATCGTTTAG CTGGGGCAGA 18960 

ATTTGTAATT GCAAATGCTG ATAATGCTGG TCAATATTTA GCACGTAAAG CAGATAAAGT 1902 0 

GAGTCAAGAA GAGAAGCAGT TGGTTGTTAC AACAAAGGAT GCTTTAGATA GAGCAGTTGC 19080 

TGCTTATAAC GCTCTTACTG CACAACAACA AACTCAGCAA GAAAAAGAGA AAGTTGACAA 1914 0 

AGCTCAAGCT GCTTATAATG CTGCTGTGAT TGCTGCCAAC AATGCATTTG AATGGGTGGC 192 00 

AGATAAGGAC AATGAAAATG TTGTGAAATT AGTTTCTGAT GCACAAGGTC GCTTTGAAAT 192 60 

TACAGGCCTT CTTGCAGGTA CATATTACTT AGAAGAAACA AAACAGCCTG CTGGTTATGC 19320 

ATTACTAACT AGCCGTCAGA AATTTGAAGT CACTGCAACT TCTTATTCAG CGACTGGACA 19380 

AGGCATTGAG TATACTGCTG GTTCAGGTAA AGATGACGCT ACAAAAGTAG TCAACAAAAA 19440 

AATCACTATC CCACAAACGG GTGGTATTGG TACAATTATC TTTGCTGTAG CGGGGGCTGC 19500 

GATTATGGGT ATTGCAGTGT ACGCATATGT TAAAAACAAC AAAGATGAGG ATCAACTTGC 19560 

TTAAGTAAGA GAGAAAGGAG CCATTGATGA CAATGCAGAA AATGCAGAAA AT G ATT AG T C 19 620 

GTATCTTCTT TGTTATGGCT CTGTGTTTTT CTCTTGTATG GGGTGCACAT GCAGTCCAAG 19680 

CGCAAGAAGA TCACACGTTG GTCTTGCAAT TGGAGAACTA TCAGGAGGTG GTTAGTCAAT 19740 

TGCCATCTCG TGATGGTCAT CGGTTGCAAG TATGGAAGTT GGATGATTCG TATTCCTATG 19 800 

ATGATCGGGT GCAAATTGTA AGAGACTTGC ATTCGTGGGA TGAGAATAAA CTTTCTTCTT 198 60 

TCAAAAAGAC TTCGTTTGAG ATGACCTTCC TTGAGAATCA GATTGAAGTA TCTCATATTC 19 920 

CAAATGGTCT TTACTATGTT CGCTCTATTA TCCAGACGGA TGCGGTTTCT TATCCAGCTG 19980 

AATTTCTTTT TGAAATGACA GATCAAACGG TAG AG C CTTT GGTCATTGTA GCGAAAAAAA 2 0040 

CAGATACAAT GACAACAAAG GTGAAGCTGA TAAAGGTGGA TCAAGACCAC AATCGCTTGG 2 0100 

AGGGTGTCGG CTTTAAATTG GTATCAGTAG CAAGAGATGT TTCTGAAAAA GAGGTTCCCT 20160 

TGATTGGAGA AT AC CGT T AC AGTTCTTCTG GTCAAGTAGG GAGAACTCTC TATACTGATA 2 0220 

AAAATGGAGA GATTTTTGTG ACAAATCTTC CTCTTGGGAA CTATCGTTTC AAGGAGGTGG 2 0280 

AGCCACTGGC AGGCTATGCT GTTACGACGC TGGATACGGA TGTCCAGCTG GT AG AT CATC 2034 0 

AGCTGGTGAC GATTACGGTT GTCAATCAGA AATTACCACG TGGCAATGTT GACTTTATGA 20400 

AGGTGGATGG TCGGACCAAT ACCTCTCTTC AAGGGGCAAT GTTCAAAGTC ATGAAAGAAG 20460 

AAAGCGGACA CTATACTCCT GTTCTTCAAA ATGGTAAGGA AGTAGTTGTA ACATCAGGGA 2 0520 

AAGATGGTCG TTTCCGAGTG GAAGGTCTAG AGTATGGGAC ATACTATTTA TGGGAGCTCC 2 0580 
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AAGCTCCAAC TGGTTATGTT CAATTAACAT CGCCTGTTTC CTTTACAATC GGGAAAGATA 



20640 



CTCGTAAGGA ACTGGTAACA GTGGTTAAAA ATAACAAGCG ACCACGGATT GATGTGCCAG 



20700 



ATACAGGGGA AGAAACCCTT GTATATCTTG ATGCTTGTTG CCATTTTGTT GTTTGGTAGT 



20760 



GGTTATTGTC TTACGAAAAA ACCAAATAAC TGATATTCAA T GT AC AT CAT TATGAATAGG 



20820 



ATAGCAGGCT GAAGGGAAGA C C AG AGT ACT CTGAGGTGAT GTTAATCAGG AATCATGGTG 



20880 



ATGTGGCATG AAT CAT C AAT AACGGATATG AGGCTGGGCA GATTGTGCCA GCCTCATTGT 



20940 



GGGTTATTGT TTGTAAAACG ATAGGACTGG TCTGGTAATC ATTTTA 



20986 



(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CCCAGCAAAA AGCCATCCGA AGATGACTTT TTTGCTATTT AATTTCTGTA TAAGTTACTT 60 

CCAAGCCACG CTTAACAGCT GGACGATTGG CAATTTTTTC TGCCCATTTT ACTAGATTTT 12 0 

GATAACTTGA GGCATCCAAG AATTTTGCAG AACCTTGGTA AAGATTTCCT TGAACTAACT 180 

GTCCATACCA AGACCAGATA GCAATATCTG CAATCGTATA GTCATTGCCT GCAATATAAG 240 

GTTTCTGAGC CAATTCCTTA TCCAATAAAT CCAACTGGCG TTTCACTTCC AT CGT AAAAC 3 00 

GGTTAATAGG ATATTCCAAT TTTTCAGGAG CATAATTGAA GAAATGTCCA AATCCCCCAC 3 60 

C T AG AAAAGG TGCTGCACCT GCTTGCCAGA ATAGCCAATT CAAAACTTCT ACCTTTTCCA 420 

CAGGATTACT TGGTAAAAAG GCTCCAAATT TCTCAGCAAG GTAAAGAAGA ATATGAGCAG 4 80 

ACTCAAAGAC TCTTACGTTT TCAGTACCTG ACTGGTCCAA TAAGGCTGGA ATCTTGGAAT 54 0 

TTGGATTGAG CTTCACAAAG TCTGATCCGA ATTGATCCCC AT C C ATG AT A GCAATCTTAT 600 

ACAAGTCGTA AGCCGCTTCC T T AAAACC AG CTTCTAGTAA TTCTTCCAAT AAGATAGTAA 660 

CCTTCACACC ATTTGGTGTT CCCAGTGAAT AAAGCTGAAA AGCTTGTTCT CCTTTTGGCA 72 0 

AGTTTTGTTC GAAACGGGCA CCTGCTGTTG GTCTGTTTAG CCCCGTAAAA GCTCCTTGAT 7 80 

TACTAGCTTC ATCCTGCCAT ACGGT CGGT A ATTGATATGC TGACATCCGA AACCTCCCTT 84 0 

AAATCGCATT CTTGTCAAAA CCGAGTTTGC GT T G AAT AAA CTTAACGATT TCGACGATGA 9 00 

TAATCATTGA GAAGCTTCCA GCCATAACAA TTCCCCATTG TGACAAGTCT AGTTTGGTTA 960 

CGTGGAAGAT TCCTTCAAGC GGTTCTACAA CGATTGTTGC CATGAGAAGG ATAAAGGATA 102 0 
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CCAAGATGGA CCAGTTAAAG GTCTTAGACT TGAATGGGCC AACTGTCAAG ATGGATTGGT 10 8 0 

AG AC AG AC TT GACATTGTAG GCATGGAAGA GCTGAATCAA ACCAAGGGTT GCAAAGGCCA 1140 

TCGTTAGGGC ATCTGCATGA ATAGCATGAT TGTCACCCAC ATGAACTGGG TAAGCAATCG 12 00 

CAAGGCCATA AACACTCATA ACAAGAGCTG CTTGGAGTAC ACCTTGATAA ATGATAGAAC 12 60 

T C AAAAC AC C AC CTG AG AAG AAGCTTGCCT TGCGTCCACG TGGTTTATGA TTCATGACAC 13 20 

CAGGTTCCGC AGGTTCAACA CCAAGAGCGA TAGCTGGGAA GGTATCCGTT ACCAAGTTGA 13 80 

TCCACAAAAG ATGAACCGGC TGTAAGACAT CCCAACCAAA CAAGGTTGAT AGGAAGATGG 144 0 

TTAATACTTC AGCAGTATTA GCAGAAAGTA GGTACTGAAT AGTCTTTTGA ATGTTTGAGA 15 00 

AGACCTTACG TCCTTCTTCC ACTGCGACGA TAATAGTCGC AAAGT T AT C A TCTGCAAGAA 15 60 

TCATATCAGA AGCCCCCTTA GAAACCTCTG T AC C AGTG AT TCCCATACCG ATACCGATAT 1620 

CGGCTGTTTT CAGAGCTGGC GCGTCATTGA CACCGTCACC TGTCATGGCA ACGACTTTAC 1680 

CTTGTTTTTG CCAAGCCTTG ACGATACGAA CCTTGTGTTC TGGAGACACA CGGGCATAAA 174 0 

CAGAGTATTG ACCAACGACT TTTTCAAATT CTTCATCTGA CAGTTCATTG AGTTCAGCAC 1800 

CAGTTAAAAC GTGACCTTCT GTATCGTTTG CGTCAATGAT TCCCAAACGT TTGGCAATGG 18 60 

CTTCCGCTGT GTCTTGGTGG TCACCTGTAA TCATAATTGG ACGGATTCCC GCTTCCTTAG 192 0 

CCACACGAAC AGCCTCAGCG GCTTCAGGAC GTTCAGGGTC AATCATCCCA ATCAAACCAG 1980 

TAAAAATTAA ATCATTTTCA AGCTCTTCAG AAGTGAGATT TTCTGGAATA CTATCGATAA 2 040 

TCTTATAAGC ACCTGCAAGG ACACGCAAGG CTTGATGAGC CATTTCAGAA TTGTTTGTAC 2100 

GAATGAGATT TGTAACCTTC TCATCAATCG GAGCAATATC CCCAGCCTTA TCACGAAGAA 2160 

GACAACGTTT TAAGAGTTGG TCTGGCGCAC CCTTGACTGC TACAAGGAAA CGACCATCTG 22 2 0 

GCAATGGGTG AACTGTTGAC ATGAGCTTAC GGTCAGAGTC AAATGGCAAT TCAGCTACAC 2 2 80 

GAGGATATTT CTCTAAGAAA CCTTTGACAT CATAGCCCTT GTCCAAGGCA TATTGGATAA 2340 

AGGCTGTTTC GGTTGGGTCA CCAATCAAGT TACCTTCCAC ATCGATTTTC GTATCATTGG 2400 

CCAAGACAAC TGAACGAAGT AGTGGCATTT CAAGACCTAG TTCAATATCA TCAGCTGAGT 2 4 60 

CATGTAGAAC CGCATCGTAG AAGACTTTTT CGACTGTCAT CTTGTTCATA GTCAGCGTAC 2520 

CAGTCTTATC AGAAGCG AT G ATTTCAGTTG AACCAAGTGT TTCAACTGCT GGCAACTTAC 2 580 

GAACGATGGA ATGTCGTTTG GCCAAAACTT GAGTACCAAG AGAAAGAACG ATGGTAACGA 2 64 0 

TAGCAGGAAG TCCTTCTGGA ATGGCTGCAA CGGCAAGGGC AACAGAAGTC AACAACTCAC 2 7 00 

CAAGTGGATT TTTCCCTTGA ATGAAGACAC CCACTACAAA AGTAACAAGG GCAATGACCA 27 6 0 
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AGATAGCATA GGTCAAGACC TTAGAAAGGT TGTTCAAATT TTGTTTGAGT GGTGTATCAG 2 82 0 

TCTCATCCGC ATCTTGAAGC AT AC C AG C AA TATGACCAAC TTCAGTGTAC ATACCTGTAT 2 8 80 

TGACAACAAC ACCCATCCCA CGACCATAGG TTACGTTTGA GTTTTGGAAG GCCATGTTGA 2 940 

CACGGTCACC AATACCAGCA TCTGTCGCAA GCTCGACTGA CAAGTCTTTT TCGACTGGTA 3 000 

CAGATTCACC TGTCAAGGCT GCTTCTTCAA TTTTAAGAGA GTTGGCTTCT ATCAAACGTA 3 0 60 

GGTCCGCTGG T AC C ACGTC A CCTGCTTCAA GGGCAACGAT ATCGCCTGGT ACCAATTCTT 312 0 

TAGAGTCAAT CTCTGCCATG TGTCCATCAC GAAGAACGCG GGCAACTGGA CT AG AC AT GG 3180 

ATTTGAGGGC TTCAATAGCT TCTTCAGCTT TTCCTTCTTG GTAAACACCA AAGGCAGCGT 3240 

TGATGATAAC CACAGCTAGG ATGATAATGG CATCTGCGAT ATCTTCCCCA CCAGAAGTCA 3 3 00 

CGACTGACAA GATTGctGCC GCAACTAGGA TGATAATCAT CAAATCCTTA AATTGCTCGA 3 3 60 

TGAATTTGAC CAAGATTGAT CGTTTCTCGC CTTCTTCGAG TTCATTGTGC CCAAATTCGG 34 2 0 

CAAGGCGCTT TTCCGCCTCA CTTGATGACA AACCTTGCTC GGTCGCATCC ACAGCCTGCA 3 4 80 

AGACCTCTTC AGGGCTCTGA GTATAAAACG CTTGGCGTTT TTGTTCTTTT GACATGTGTC 354 0 

TCCTCCTTGA CATTGTGTGC AAAACAGACT CTCTTTCTGT CATAGCTTTT CACGACAAAC 3 600 

AAAAAGAAAC CTGTTAATCA TAACAAGTCT CGCTGTTTAA GATAGGGCCG G AAAG CAT AC 3 660 

TTTTCAGCAT AAAATT CGG A AT G ACG AC AC TATCACAGGT TTCTGCCAGC TACTCCCTTG 3 72 0 

AGTAGTACCA TTATACCAAA TTTTGGGGAG TT T T C AAAG A GTAAAAACTG CCTTATTTGA 3780 

ATTTTTCCTT GAAAACCAGT ATAATGGTAG AATGCTATGT GACTAGAAAG GAAGTTGAAT 3 84 0 

GAAGCAATCT ATCTCAAATC TCAAGTTAGC TGAGCGTGGA GCCATTATCA GTATTTCGAC 3 90 0 

CTATTTGATC TTGTCTGCAG CCAAATTAGC AGCTGGTCAT CTCCtTCATT CATCCAGTTT 3 9 60 

GGTGGCCGAT GGTTTTAATA ACGTATCGGA CATCATTGGA AATGTGGCCC TCTTAATCGG 4 02 0 

GATTCGGATG GCGCGCCACC TGCAGACCGT GACCACCGTT TTGGTCATTG GAAGATTGAA 4 08 0 

GATTTGGCAA GCTTGATCAC TTCTATCATC ATGTTCTATG TCGGTTTCGA TGTTCTAAGA 414 0 

GAT AC C ATT C AAAAGATTCT CAGTCGGGAA GAAACGGTCA TTGATCCTCT TGGTGCAACT 42 00 

CTAGGAATCA TTTCTGCAGC GATTATGTTT GTGGTCTATC TCTACAATAC TCGCCTCAGT 42 60 

AAGAAATCCA ACTCCAATGC GCTGAAGGCA GCTGCTAAGG ACAATCTTTC TGACGCTGTT 4320 

ACCTCACTTG GAACCGCCAT TGCCATCCTA GCTAGTAGTT TCAATTATCC GATTGTGGAT 43 8 0 

AAACTGGTTG CT AT CAT CAT CACTTTCTTT ATCTTGAAGA CTGCCTATGA TAT CTT CATC 4440 

GAGTCTTCCT TTAGTCTTTC AGATGGCTTT GACGACCGCC TGCTCGAGGA CTACCAAAAG 4 500 

GCTATCATGG AAATTCCCAA AATCAGCAAG GTCAAATCGC AAAGAGGTCG CACCTACGGT 4 560 
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AGCAACATCT ACCTGGATAT TACACTAGAG ATGAATCCTG ACTTGTCTGT TTTTGAAAGC 4 62 0 

CAT G AAAT CG CGGATCAGGT CGAGTCTATG CTGGAGGAGC GTTTTGGCGT CTTTGATACC 4 6 80 

GATGTCCATA TCGAACCAGC ACCTATCCCT GAGGATGAAA TTTTAGACAA TGTCTATAAA 4740 

AAATTGCTTA TGCGTGAACA ATTGATTGAC CAAGGAAACC AACTAGAAGA ACTCTTGACT 4800 

GATGATTTTG TCTATATTCG CCAAGATGGA GAGCAGATGG ATAAAGAGGC TTATAAGACC 48 60 

AAAAAAGAGT TAAATTCTGC TATCAAGGAC ATT C AAAT T A CTTCCATCAG TCAAAAAACC 492 0 

AAACTCATCT GCTATGAGTT AGATGGTATC ATCCATACCA GTATCTGGCG TCGCCACGAA 4980 

AC C TGGC AAA ATATCTTTCA TCAAGAAACC AAAAAAGAAT AGAGAAATCC TTTCATGAGA 504 0 

CGGGATTTTT CTATTCTTTT ATACTCAATA AAAATCAAAG TGCAAATTAG GAAGCCGGTC 5100 

ACAGGCTGTA CTTGAGTCGG CAATGTGAAG CCGACATAGT TTGCACTTTG ATTTTCGAAT 5160 

AGTCTTAACT ATCAAATTCA CTGAGATACT CATAGCGTTC GTATTTTTCA AGGAGTGCTT 52 2 0 

CATTTTTCTC ATCCAATTCT TTTTGGAGAG TAGCCAGCTT ACCAAAGTCA GAGCCGTTAG 52 80 

CCTGCATTTC CTCTTCAATA GCAGCGATAC GTTTTTCCAA GGTTTCAATA TCACCTTCAA 53 4 0 

TACTTGCCCA CTCCTGCTTT TCTTGGTAGG TCATGCGTTT CTTGTCTTCT CGAACCTTGA 5400 

CCACTTTTTC CTTTTCGGCC TTTTGCACTT GATTGGCCAT ATCTGTTTCA AAAGCTTTTT 54 60 

CATCAAGATA GTCGGTGTAA TGACCAAAGA AAGGACGAAT CTTGCCATCC TCAAAAGCGA 552 0 

GAATCTTGGT CGCTACCTTA TCCAAGAAAT AGCGGTCGTG ACTGACTGTT AAAACGGGAC 5580 

CTGCAAAACC TTGCAAGAAA TTCTCTAAGA CTGTCAAAGT TGCAATATCT AGGTCATTGG 5 64 0 

TTGGCTCGTC TAAAAGAAGA ACATTTGGTT TTTCCAAAAG CAGTTTGAGG AGATAAAGAC 570 0 

GTTTTTTCTC ACCCCCTGAC AATTTCTCAA TCAAAGTCCC ATGCGTCGAA CGTGGGAAGA 57 60 

GGAATTGCTC CAGCAACTCA GCGATGGAAG TCGTAGAACC ACCACTGGTC TTGACCTCCT 5820 

CTGCCACTTC CTGCAGGTAA TTGATCACAC GCTTGCTTTC ATCCAAACCC TCAATTTGTT 5880 

GAGAGAAATA GGCGATGCGA ACAGTTTCCC CAATCACAAC TTGTCCTGCT GTCGGCTCAA 5940 

GACTTCCTGC AATCAGGTTA AGTAGGGTTG ATTTTCCAAC AC CAT TGT C C CCAACAATTC 6000 

CAATACGGTC TTTAGCCTGA ACTAAGAGAT TAAAATTTTG CAAAATGGGC TTATTTTCAT 6060 

AGGCAAAGGA AACATCCTGA AACTCGATGA CTTTCTTCCC AATCCGACTG GTTTCAAAGT 6120 

TCATAGTCAA GTCTGTCTCA GCACTACTGC CTGAAACTTC CTTTTTCAGA TCATGGAAAC 6180 

GATTGATACG AGCTTGTTGC TTGGTCGCAC GCGCCTGCGG TTGTCTGCGC ATCCAGGCCA 6240 

ATTCTTGTTT GTAGAGTTGT TCTTTTTTGT GAAGAAGAGC CGCGTCGCGC TCATCCTGTT 6300 
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CCGCCTTTAG GCGAACATAG TCCTGGTAAT TTCCCTGGTA CTCGGTCAAG CCTGCACGAT 6 3 60 

CCAACTCGAA AATCCGTGTT GACAAAGCGT CTAAGAAATA ACG AT CGTG A GTGATAAAAA 642 0 

GGACGGTCTT CTTAGAATTT TT C AAAAAG A GGGTCAGCCA CTCAATAATC GCAATATCCA 64 80 

GATGGTTGGT CGGCTCATCC AAAAGCAAGA GGTCGTGGTT GCCAAGTAAG ACTTGTGCCA 6540 

ACTGTACCCG TCTTCTCAGA CCACCTGACA ATTCCCCAAC AGGAGTAGAT AAGTCTTGAA 660 0 

TGCCCAATTT GCTAAGAACG GTCTTGACCT GACTTTCGAT TTCCCAAGCT TGGAGAGAGT 6 6 60 

CCATCTCTGC CATGACACGT TCCAAACGCG CCTGCTTGTC CTCACTATAG TCGAGCATAA 672 0 

TCAATTCATA CTCACGAATG AGCTGGATTT CCTTGAGTTC ACTAGATAGA ACCGTATCCA 67 80 

AAACTGTCTT TCTATCATCA AAATCAGGAT CCTGAGTCAA GTAACCAATC TGGT AAT CAT 6 84 0 

TTTTAGCTGA AAAAGGACTG ACATCCCCAT CAAATCCAGA AACACCAGAA AGGACGTCCA 6900 

AAAGGGTGGT CTTGCCAGTC C C ATT G AC AC CGATTAAACC AATTCTGTCT AAGTCATGGA 6960 

TAATAAAGGA AATATCCCTA AAAACGGTCT TGTCACCAAC GGATTTACTT AGTTTTTCAA 7020 

CGATAAAATC ACTCATTTTT TCTCCCTCAG GTAAGCATGG ATGGCTTCAC GATTATTCTC 7 080 

CAATTCTCCA TCGACAATGG CAAACTCAAT CTCTGTTAAA ATCTCTCCCA AGTCTGGGCC 7140 

TGGCTGATAG CCATATTCCT TGATCAAAAT AC C GCC ATT A ATCTGAATCT CTTTCTTGTC 72 00 

ATGGATAGTC AAGCTTTGGT ATTTTTCTGT GATGGCTTGT GGGTTGACTT CTTTTCCTTG 72 60 

AGCTTGACGA AGATTTTCAG CCTGTAAAAG CAAATCTATG TCAAAGCGAT AACAATCTCG 732 0 

CTTGCTCAAT TCTCCATTTT CACGCAGAGC CAAAATAATC AGCAAATCCT GAACTTGCTT 7 3 80 

GGCAAACTGG CGTGAGGTCT TCCAAGATTT CAAAAATGAC TGCGCATTTT CAATCTCCAA 744 0 

AGCCCATAGT AAAGCCGCCC AGGCTTGTTC AG AGG ATT C A AAAGTAAAAT CAGTCTCCAA 7 500 

ATCAAACAGT CTGTTGAGCT TGTCCTGGCT AGATGCCATA TCAGGGAGAT AGTCATAAGC 7 5 60 

TTGACTCTCA ATCATGGAAG CCAAGCCCCT TCTCCAAAAT GGAGCCAGCA AGAGTTTATC 7 62 0 

AAACTCGACG AAGGTACGCT CTACAGAAAT TTTCTCCAAA AGCGGCGTCA AGGTCTTCAT 7 68 0 

AGCTTTAAAT GTTTCTGGCT CAAGTGCAAA ACCAAGACTA GCCTGAAAAC GGAAACCACG 774 0 

CATAATCCGT AAAGCATCTT CGTTGAAACG CTCACTAGCC ACTCCAACTG CTCGCAAGAC 7 800 

TTGCTTTTCC AAATCTTCTA AACCATGGAA CAAGTCAACG ATTTCTCCTG TCTCATCCAA 7860 

GGCAAAGGCG TTGACTGTGA AATCACGGCG TTTGAGGTCT TCTTCTAGCG AT CGT AC AAA 7 92 0 

GGAAACCGCA CTGGGTCTGC GATAGTCCAC AT AG AC AT C C TCTGTCCGAA AGGTTGTTAC 7 9 80 

CTCATACTCC TCATCCCCAT CTAAGACCAA GACGGTTCCA TGCTCGATTC CGATATCGGC 804 0 

TGTTCGCGGA AAAATCTGCT TGGTCTCTTC TGGATAAGAA GACGTCGCAA TATCCACATC 8100 
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GTGGATAGGG CTATGGAGAA GGGCATCTCG AACAGAGCCC CCAACAAAAT AAGCCTCAAA 816 0 

GCCTGCTTCT TTAATTTTTT CTAATACTGG TAAAGCCTTC TGAAATTCAG AAGGCATTTG 8220 

CGTTAATCTC ATAATAAGTG TTCTAATCCA TAGACAAGCT CATGACGCTT GACAACTTCT 82 80 

TTAATTCCCA AATTGACTCC TGTCATGAAG GAGATGCGAT CATAGGAGTC ATGACGGAGG 83 40 

GTCAACCCTT CTCCCTGATT GCCAAAGATG ACTTCCTGAT GAGCTACCAA GCCTGGCAAA 8400 

CGAACTGAGT GGATGCGCAT AC CAT C AAAG TCAGCACCAC G AG C AC C AG C AATCAGCTCT 8460 

TCCTCATCTG CTGCACCTTG CTGAATTGAC TCTCGAACCT CTGCCATCAA CTCAGCTGTT 8520 

TTAATGGCTG TTCCACTCGG AGCATCCTTT TTCTTGTCAT GATGGAGCTC AATAATCTCC 85 80 

ACATTTGGGA AATATTTGGC AGCCTGCGTC GCAAATTGCA TGAGTAAGAC AGCACCCAAG 8 640 

G C AAAGTT AG GGGCAATCAG GCCACCCAAG TCTTGGGCAC GAGAAAATTC TTTTAGCTCT 8700 

GCAATTTCTT CACTCGTGAA ACCAGTCGTT CCAACTACTG GAG C AAAG C C ATTTTCAAGA 87 6 0 

GCAAAACGTG TATTTTCGTA GGCAACAGCT GGAGTAGTAA AATCTACCCA GACATCCGCT 882 0 

TCAAAACCAG CTAAATCAGC CTTATCCTTG AAAACAGGAA TACCCTGCCA TTCTGACTCA 88 80 

GACTCAAAAG GATCCAAAAC TGCCACCAAG TCCAAGTCTG GATCAGTCAA TACCATCTGA 8 940 

CAAGCAGCCT GGCCCATCTT TCCCTTAAAA CCGGCAATAA TTACTCGAAT ACTCATCTCT 9 00 0 

ACTCCTGTCT AAGATACAAA GTCCGTAAGA ACACAAAGTG AAAATAGGAA TTCCAATCAA 9 0 60 

GAAGTGTCTA CTTCTTGGAA GAACTATCTT TTTCACACAG GGTTCCAGGC GTGTTCAATT 912 0 

AT C AAG AT AC AAAGGACCTT AGCTGCCTCT GAAAAATAGG GAATGGCACT GACTTTCCAC 9180 

GAAAGGCAAG ACAGGCATCT TTTTTCAAGA GGCAGGTAGT CCGTGTTCAA TTTCTAAGAT 924 0 

AC AAGGC AT C TTAACTAGCC TAGAAGCGCC AACTAAATCA CTGGAATATA ACCCAGAGCA 93 00 

ATACTTCCTG CTCCTAGGTG CGTTCCAATG ACACTACCAA ATGTAGCAAG TGAAACATCC 93 6 0 

GAACCCAAGC CAAAATCAAG CAAGTGcTGA CGCAATTCTT CAGCCTTTTC AGGAGCATTC 942 0 

CCATGAATGA CAATGACCCG GTATTGACCT GAAGCCGTTG TTTCCTTGAT AATTTCAATT 94 80 

AAGCGCTTGG TGGCCTTCTT TTCAGTACGA ACTTTTTCGT AAACTTCAAT CACACCTTGA 9540 

TCGTTAAAAT AAAGGATTGG CTTAATGCTA AGCAAATTGC CCAAAATGGC AGCCCCATTT 9 600 

GAAAGGCGTC CACCTTTTAC C AAATG AT C C AAGTCATCTA C CAT GAT AAA GGCTGACGTA 9660 

CGGCTGATTT GAATGGCTAG CTTATCCTGA ATGCTGGCAA AATCATCGCC CTGATCACGC 972 0 

CAATTAAAGA CGCTTTCAAC CATGATGCCT AGGGGAGCAC TTGTAATCAA AGTGTCTGGG 97 8 0 

AAAGCAATGG TTAAGCCCTC ATAGTCATCG ACCATATACT GGATATTTTG GTAAAAACCT 9840 



WO 98/18931 



PCT/US97/19588 



496 



GAAATTCCAG 


AAGATAGGAA 


AAGCCCCAAG 


GCATGTGTAT 


AGCCTTGTTC 


TTTGAGCGAA 


9900 


GTTAAGATCT 


CAT CT AACT T 


GGCAATACTT 


GGTTGACTGG 


TCTTAGGCAA 


TTCAGAAGCC 


9960 


TGAGCCATTT 


TTTGGTAAAA 


TTCCTCAGCA 


GACAGATTGA 


TGCCTTCGAC 


ATATTCCTCA 


10020 


CCATCAATAT 


TGACAGGAAT 


ATCCAAGACA 


AACAAGTCTT 


CTCTTTGCAA 


GATCTCTGCA 


10080 


CTGAGATAAG 


CAGAGGAATC 


TGTGAAAACA 


GCTAATTTCA 


TATTAGAACT 


CCAAATTAAT 


10140 


TCCTGGTAAG 


TCTAATGCAA 


TTTCAGTCAC 


TTCGTAAGTC 


AAACGATTGA 


GCATGTTCAA 


10200 


AC ATGGAC G A 


GCCAAGGTTT 


CCACCTCTTC 


TTGGTTCAAT 


TCACTTGGTT 


CATTGACAAT 


10260 


ACGGCCATCG 


ATATGGTTTA 


CTTGTGAGAT 


TGTTCCACTA 


ATGACAAACT 


TATCAAATAC 


10320 


AATCATAAAG 


CTCAAGATGA 


CAATCAAGGA 


AGTCACTTGA 


TTTTCTTGGT 


CATGTTGGAG 


10380 


CAATTGGAAA 


TTCACATCCA 


CCTTGGTTTC 


AGGAGCTCCA 


TTTTCATTTT 


C CC ATT C AAA 


10440 


ATTACGCGCA 


T C AAAATG AT 


ACTGACTAAC 


AAATTCTTGT 


TCACGTTTAA 


GATTCATGTC 


10500 


TTTCTCCATC 


GGCTACAATA 


TTATAAGCTA 


TTGTACCATA 


ATTTTTTATT 


TTCATCTAGT 


10560 


TTTCTAGGAT 


TTAGTCAATC 


CCAATTTCAG 


CACGAACTAC 


ATCTGTGATG 


GTATCAACAT 


10620 


AGTAGTTTAC 


TTCTTCTGTT 


GTAGGCGCTT 


CTGCCATAAC 


ACGCAAGAGG 


GGTTCTGTTC 


10680 


CACTTGGACG 


AACAAGGATA 


CGGCCGTTCC 


CCGCCATTTC 


TTCTTCCATC 


TTCTCGATGA 


10740 


TGGCCTTGAT 


AGCTGGCACT 


TCCATGGCCT 


TTTCCTTCAT 


GACGTTTTCC 


ACTCGGATAT 


10800 


TAACTAATTT 


TTGTGGATAA 


ATCGTTACTT 


CTGCCGCCAA 


CTCTGATAAG 


CTCTTACCAG 


10860 


TTTCCTTCAT 


GATTTTAGTC 


AATTGAACTG 


CTGATAATTG 


ACCATCACCT 


GTGGTATTGT 


10920 


AATCCATCAA 


GATAACGTGA 


CCAGACTGTT 


CACCACCAAG 


GTTGTAGCCT 


GATTTTCTCA 


10980 


TTTCTTCAAC 


AACGTAGCGG 


TCACCAACTG 


CAGTAACTGC 


CTTGTTAATA 


CCTTCGCGAT 


11040 


TCAAGGCCTT 


GTGGAAACCA 


AGGTTAGACA 


TAACAGTTGT 


CACAATTGTA 


TTTTGAGCCA 


11100 


ATTGTCCTTT 


TTCAGAAAGG 


TATTTTCCGA 


TGATGTACAT 


AATCTTGTCA 


CCATCAACGA 


11160 


TGTCACCATT 


CTCATCAACA 


GCAATCAAGC 


GGTCACTGTC 


TCCATCAAAG 


GCCAAACCAA 


11220 


TAGCTGACCC 


ACTTTCTTTG 


ACCACTTCTT 


GAAGGGCTTC 


TGGATGTGTT 


GAACCAACAT 


11280 


TAAGGTTGAT 


GTTAAGACCG 


TCTGGTGTTT 


CCCCGATAAC 


CGTCAATTGG 


GCACCAAGGT 


11340 


CTGCAAAGAT 


TTGACGGGCA 


CTGGTAGAAG 


CTGCTCCATT 


AGCTGTATCC 


AAGGCAACCT 


11400 


TCATTCCATC 


AAGAGGAGTT 


CCAGTTGAAA 


CAAGGTATCC 


TTCATACTTA 


CGCArGCtTC 


11460 


TGGATAATCT 


ACC AAAATT C 


CTAAGCCTTC 


TGCACTTGGA 


CGAGGAAGAG 


TGTCTTCCTC 


11520 


AGCATCTAGC 


AAGGCTTCAA 


TTTCTGCTTC 


TTTTTCATCA 


TCTAGTTTGA 


AGCCATCACC 


11580 


GCCAAAGAAC 


TTGATTCCGT 


TATCAAGGGC 


TGGGTTGTGG 


CTAGCAGAAA 


TCATGACACC 


11640 
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GGCACTTGCT 


CCTTCAGTTT 


CAACCAAGTA 


AGCTACTGCT 


GGTGTTGCAA 


GGACACCAAG 


11700 


TTTGTATACG 


TGAATCCCTA 


CTGAAAgGAG 


ACCTGCCACC 


AAGGCCGATT 


C C AAC ATT T C 


11760 


CCCTGAAATA 


CGTGTGTCAC 


GTCCTACAAA 


GACTTTCGGC 


GCTTCCGTTT 


CATGTTGACT 


11820 


AAGAACATAG 


CCTCCAAAAC 


GTCCTAGTTT 


AAAGGCTAAT 


TCTGGTGTTA 


GTTCTAGGTT 


11880 


AGCTTCTCCA 


CGGACTCCAT 


CAGTCCCAAA 


ATATTTACCC 


ATTGTTATAA 


AATCCTTTTC 


11940 


TATTTTTAAT 


TCGTTTTTGA 


ACTAGTTGCT 


TTCGTTGACG 


AAGATGTCTC 


CGATGAACTG 


12000 


CTTGTACTTG 


AATTTGATGT 


GCTTGAACTT 


GGTGCTACTG 


GTTTTGTAGT 


CACCTTCATT 


12060 


ATTGTATCAA 


ACGGAGTGAT 


AACTGCCGGT 


AAGACAACAC 


CATTGCGGTC 


GATTGCCTGC 


12120 


AAAGGTACTG 


AACCACTGTA 


ATTACCTGTT 


ATACGTTCGC 


TAGTTGGCAA 


AAC AG CG AT A 


12180 


ATCTTATCAA 


TTCTATCCAA 


TGTCTCTTGG 


TCACTCGTAA 


TAGACACTTC 


TTTATCTGAC 


12240 


AC C ATG AC AT 


TTTCAATTTG 


TACCCGACTA 


TCAATTTGAC 


TAGGGTCAAT 


CTCTGGTACA 


12300 


ATCTTTACCT 


TATCCTTCTG 


AGCCTTCTTA 


CCAATCTTGA 


CTGTAATTTT 


TTGCGGAGTC 


12360 


GCCACAGCGG 


TCAGCCCATT 


GGGTAAATCT 


TCAATGCTCA 


AAGGAACTTC 


AATCGTTCCA 


12420 


ACACCGGCAT 


CTGTTAGGTC 


AGCAGTAACC 


TTGAATTTAC 


GTGTACTTTC 


TTGCATTTCA 


12480 


CTAGCTAGCG 


ATAGGCGATT 


TGCACCAGTC 


AAGACCACTG 


ATACTTCTGA 


AGCAAAACCG 


12540 


CTAATAAAAT 


ACTTATCACT 


ATTATAGCGT 


ATGTCAATAG 


GGACATTTGT 


TACTGTATTA 


12600 


GTATAGGTTT 


CCGTTTTTAC 


CTGCCTAGCA 


CTGGTACTGT 


TTTGAAAATT 


CGTCGCCGTA 


12660 


GCATAGACAA 


ATAAGACACA 


AGCAAAAAAG 


AGTGAGGATA 


TGATATATAA 


ACTATTTTTT 


12720 


TTCATGTTTC 


CATCCTCCTA 


GCAATCGTTC 


TTTAAAACTA 


AGACCCACTT 


CCTCTTTTGG 


12780 


AAGTAAGATT 


TCACGTAATT 


CTGTTTCAAA 


TTCATCAAGT 


GTTAGGTTGT 


GCTTAAACCT 


12840 


TCCATTATAG 


GTTATCGAAA 


TTCCTCCCGT 


TTCCTCTGAT 


AC GAC AAAAG 


TCAAGGCATC 


12900 


TGAGACTTCT 


G AT AAACC G A 


TAGCCGCCCG 


GTGTCTGGTC 


CCAAATTCCT 


TGGAAATCCC 


12960 


TGTGTTTTTT 


GTCAAGGGCA 


GATAGGCAGA 


CGTCACAGCG 


ATACGTTCTT 


CTTTGATAAT 


13020 


CACCGCACCA 


TCATGTAGGG 


GAGTGTTGGG 


AATAAAAATG 


TTAATGAGAA 


GTTCTGCAGA 


13080 


AATCTTAGCA 


TCCAAGGGAA 


TTCCTGTCGA 


AAT AT ACT C C 


TGCAAGGTAC 


GTACACGCTG 


13140 


AATAGCAACC 


AAGGCCCCGA 


TTTTACGAGG 


ACTCATGTAT 


T C AAC AG ACT 


TAACAAAGGC 


13200 


ACGAATCATC 


TGTTCCTCAG 


CACTAATAGG 


GGCATTGGAA 


AAGAAATCTG 


TCGCTCTTCC 


13260 


CAAACGTTCC 


AAACCAGTCC 


GAATCTCTGG 


AGAGAAGATA 


ACAACCGCCG 


CAATAACCCC 


13320 


ATAAGTAATA 


ATTTGATTGA 


TTAACCAAGA 


AATCGTAGTC 


AAACCAATCA 


TATTTGCAAG 


13380 
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GATTTGAGCT 


AAAATAAACA 


CCAAAACTCC 


ACGTACCAAA 


ATCATAATCT 


TGGTTCCTGC 


13440 


AATAGCTTTT 


GTAAAATGGT 


ATAAAATATA 


AGCAACAATC 


AAAAT AT C AA 


TCAGATTGAT 


13500 


AGCTATCGTC 


CATGGACTTG 


CAAACAAACT 


GGTCCAATAT 


TGCAGATTGG 


ATAATTGTTG 


13560 


AAAATTCATC 


CCTG AT AT CC 


TCCCTATCAA 


AACACTTTCG 


TCCTATTATA 


CCATTTTCTG 


13620 


GCATTTTTTT 


CCCTATCCTA 


GTCCATTTTA 


CATTGAACAA 


AAATATGATA 


AAATAAACTG 


13680 


ACTAAAAAAA 


ACAAAGGAGA 


AACTATGTCT 


CAACTCTATG 


ATATTACCAT 


TGTGGGTGGT 


13740 


GGTCCTGTCG 


GGCTTTTTGC 


AGCCTTTTAT 


GCCCACCTAC 


GCCAAGCCAA 


GGTTCAAATC 


13800 


ATCGACTCTC 


TTCCCCAGCT 


AGGTGGACAA 


CCTGCTATTC 


TCTACCCTGA 


AAAGGAAATC 


13860 


CTAGACGTAC 


CAGGCTTCCC 


AAACCTGACT 


GGAGAAGAGT 


TGACTAACCG 


CTTGATTGAA 


13920 


CAGCTAAATG 


GATTTGATAC 


CCCTATTCAT 


CTCAATGAAA 


CGGTTCTTGA 


G ATT G AC AAA 


13980 


CAAGAAGAAT 


TTGCCATCAC 


AACTTCTAAA 


GGAAGTCACC 


TGACTAAAAC 


AGTT AT CAT C 


14040 


GCTATGGGTG 


GCGGTGCCTT 


CAAACCACGT 


CCGCTGGAAC 


TTGAAGGGGT 


TGAGGGCTAT 


14100 


GAAAATATCC 


ACTACCACGT 


T T C TAAC ATT 


CAGCAATACG 


CTGGTAAGAA 


AGTGACGATT 


14160 


CTTGGTGGGG 


G AG AC TCGGC 


TGTGGATTGG 


GCTTTGGCTT 


TTGAAAAAAT 


CGCACCAACT 


14220 


ACCCTTGTTC 


AC CGC AG AG A 


TAATTTCCGT 


GCCTTGGAAC 


ACAGTGTTCA 


AGCCTTGCAA 


14280 


GAATCATCTG 


TAAC CAT C AA 


GACACCATTC 


GCCCCTAGCC 


AACTCCTTGG 


AAATGGAAAA 


14340 


ACACTTGATA 


AACTTGAAAT 


C AC AAAAGT C 


AAATCTGATG 


AAACTGAAAC 


CATTGACCTA 


14400 


GACCACCTCT 


TTGTCAACTA 


TGGTTTCAAA 


TCTTCTGTCG 


GTAACCTTAA 


AAACTGGGGG 


14460 


CTCGACCTCA 


ACCGTCACAA 


GATTATCGTC 


AACAGCAAAC 


AGGAATCCAG 


CCAAGCAGGT 


14520 


ATCTATGCTA 


TCGGTGACTG 


CTGCTACTAT 


GACGGAAAAA 


TTGATCTGAT 


TGCGACAGGC 


14580 


CTCGGAGAAG 


CTCCAACTGC 


TGTCAACAAC 


GCTATCAACT 


ACATTGACCC 


TGAACAAAAA 


14640 


GTACAACCAA 


AACACTCTAC 


TAGTTTATAA 


AAAAGAACCA 


CGAGTCACAT 


AGGATTCGTG 


14700 


GTTTTATAAT 


TCATCCGCTA 


TCTTATTGAT 


TTTTCTGAGT 


CTGTGATTGA 


CACCACTTTT 


14760 


GGTCAGAGGG 


GTGCTGAGAC 


TATCTGCTAA 


CTGCTGGATA 


GAGTAGTCTG 


GGTGCTGAAT 


14820 


CCTCAGTTGC 


GCCACTTCCT 


GCAAATCTAC 


TGGCAAATTT 


TCTAAGCCCA 


TGATATCTTT 


14880 


GATTTTACTG 


ATATTGTTAA 


TGGTCTTCAT 


GCTGGCAGAA 


ACTGTCCGAG 


CG AT ATT AG C 


14940 


TGTCTCGGCA 


TTATTAGCCC 


GATTGAGGTC 


GTTACGGGTT 


TCTCGCAAAA 


TCTTAACCCG 


15000 


CTCAAAATCA 


TCACGTGCCT 


GCATGGCTCC 


T ATT ACT AT C 


AAGAAGTCCA 


TAATGTCTTC 


15060 


TGCTCGCTGG 


AG AT AGGT C A 


CAGCCCCCTT 


CTTGCGCTCA 


AGCACCTTGG 


CATCCAGTAA 


15120 


AAACTGTTGG 


AGAAGGGAGG 


CAATTCCTTG 


CGCGTGGTCC 


AGATAAACAG 


AACTGATTTC 


15180 



